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INFINITE LINEAR DIFFERENCE EQUATIONS WITH 
ARBITRARY REAL SPANS AND FIRST DEGREE 
COEFFICIENTS 


BY 
R. H. CAMERON AND W. T. MARTIN 


1. Introduction. Finite difference equations with arbitrary real spans have 
been investigated by various mathematicians('). In this paper we shall con- 
sider infinite difference equations with arbitrary real spans whose coefficients 
are either constants or linear functions of the independent variable. 

We shall state at the outset three theorems which give some indication 
of the scope of our work. Theorems 1, 2 and 3 are special cases of Theorems 1a, 
2a and 3a which will be proved in the body of the paper. We state these spe- 
cial cases explicitly because they are simpler and can be expressed in the usual 
terminology of difference equations. The three theorems are not independent 
since 1 and 2 together imply 3, and 2 is a special case of 3. Nevertheless, it 
seems worthwhile to quote 1 and 2 separately because of their greater sim- 
plicity and because they form the natural introduction to Theorem 3. They 
deal, respectively, with the case of constant coefficients, the case of all coeffi- 
cients constant but one (which is linear), and the case of all linear coefficients. 


THEOREM 1. Let 


(1.1) anf(s — An) = g(2) 

n=l] 
be an infinite difference equation in which the d, are arbitrary real numbers and 
the a, are complex numbers such that 


(1.2) > a,e"” 


converges absolutely to a non-vanishing function A(w) in some strip a<Re w 
<B. Furthermore let g(z) be analytic in some strip a<Im z<b. If for some 
€>0 inequalities 

Presented to the Society, October 31, 1942; received by the editors June 24, 1942. 

(‘) A treatment of a very general nature has recently been given by T. Kitagawa, On the 
theory of linear translatable functional equations and Cauchy's series, Jap. J. Math. vol. 13 (1937) 
pp. 233-332. Kitagawa gives other references to the literature. The methods used in the treat- 
ment of the first theorem of the present paper are more closely related to those of S. Bochner, 
Allgemeine lineare Differ ichungen mit asymptotisch konstanten Koefizienten, Math. Zeit. 


vol. 33 (1931) pp. 426-450, and W. T. Martin, Linear difference equations with arbitrary real 
spans, Acta Math. vol. 69 (1937) pp. 57-98. The other theorems use somewhat different 
methods. 
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| g(x + iy) | <€ for x <0 


a<y<d’ 
| g(a + iy) << for x> of 


(1.3) 


hold for every a’, B’, a’, b’ in 
(1.4) a<a<ate B-e<p a<ad< <b, 
then it follows that (1.1) has one and only one solution f(z) of the same character, 
that is, analytic in a<Im 2<b and satisfying 

| f(x + iy)| < for x<0 


| f(x + iy) | < Ka pay for x>0 


(1.5) 
for every a’, B’, a’, b’ in (1.4). Moreover f(z) is given explicitly by the formula 
f(z) = a’ g(z — a<Imz< 

n=l 


1 
, nw 2 
Lane a < Rew < B(?). 


THEOREM 2. Lei 


(1.6) af(s) = bafls — da) + 


be a difference equation in which the ,, are arbitrary real numbers and the b, are 
complex numbers such that 


converges absolutely to a function B(w) in some strip a<Re w<B of the aux- 
tliary w-plane. Let yo be the imaginary part of the constant term of (1.7) (zero 
if it has none). Then if g(z) is analytic and satisfies (1.3) where the strip 
a<Im z<b does not contain the line Im 2= yo, it follows that (1.6) has a unique 
solution f(z) of the same character, that is, analytic in a<Im 2<b and satisfy- 
ing (1.5). 


THEOREM 3. Let 


(1.8) Dd + bn) f(z — rn) = g(2) 


n=l 


(?) Throughout this paper two complex planes will enter into consideration, a z-plane 
(z =x++1y) in which the independent variable in our functional equation ranges, and an auxiliary 
w-plane (w=u-+-iv) in which we consider a= associated function which governs the growth of 
the solutions in the z-plane. 
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be a difference equation involving arbitrary real spans and complex linear coeffi- 
cients such that 


(1.9) > and (bn + 
n=l n=l 

converge absolutely to functions A(w) and B(w) in the strip a<Re w<B, the 

Sunction A(w) being non-vanishing in the strip. Then 


1 ytTi B(w) 
yo = — im lim — aw 
Tid, A(w) 


will exist and be independent of y (a< <8); and if g(z) is analytic and satisfies 
(1.3) where the strip a<Im 2<b does not contain the line Im z=yo, it follows 
that (1.8) has a unique solution f(z) of the same character, that is, analytic in 
and satisfying (1.5). 


The difference equations which we have just stated can all be expressed 
in terms of Stieltjes integrals. The difference equations are therefore special 
cases of certain Stieltjes integral equations, which are dealt with in Theo- 
rems 1a, 2a and 3a. These theorems are proved by means of the theory of 
Fourier-Stieltjes transforms. Theorem 1a which depends on Pitt’s lemma(*) 
is in a certain sense not a true Stieltjes integral theorem since the Stieltjes 


integral involved is restricted to have no singular part, and the equation is 
thus really only an infinite integro-difference equation. The same is true of 
Theorem 3a insofar as it depends upon Theorem la. No such restriction is 
required in Theorem 2a, and it is therefore strictly a Stieltjes integral equa- 
tion theorem. 

If we form a right-continuous step function a(A) with a(0)=0 and with 
jumps of a, at points A,, 


(1.10) 


then the condition that the series (1.2) be absolutely convergent in a<Re w 
<B can be expressed in an alternative but equivalent form. 


Lemma 1.1. A necessary and sufficient condition that the series (1.1) be ab- 
solutely convergent in a<Re w<f is that 


(*) H.R. Pitt, Mercerian theorems, Proc. Cambridge Philos. Soc. vol. 34 (1938) pp. 510-520. 
See Lemma, p. 513. See also I. Gelfand, Uber absolut konvergente trigonometrische Reihen und 
Integrale, Rec. Math. (Mat. Sbornik) N.S. vol. 9 (1941) pp. 51-66. Equation (3.4) of Theorem 1a 
has been solved by Pitt on the real line in the above paper and in General Mercerian theorems 
(II), Proc. London Math. Soc. vol. 47 (1942) pp. 248-267. 


> an, = 0, 

a(A) = 

| 
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0 
1.11 da(n) | da(r 
(1.11) fie + fi | daa) | < 


for every a’, B’ in a<a’' <p’ <B. 


Also the sum-function A(w) of (1.2) is given by A(w)= S°edaQ), 
a<Rew<f. 
The growth condition (1.3) can also be expressed in an alternative and 


equivalent form, as we shall show. 


LemMA 1.2. Let g(z) be analytic ina strip a<y<b, —2o <x<@, and let 
a, B, € be given real numbers with e€>0. Then a necessary and sufficient condition 
that (1.3) shall hold for every a’, B’, a’, b’ in (1.4) ts that 


0 
(1.12) sup g(x + iy)|ax+ f | g(x + iy) | 

a’<y<d’ 0 
hold for every a’, B’, a’, b’ in (1.4). 


Lemma 1.2 is an immediate consequence of the following property of 
analytic functions, which is based upon the Phragmén-Lindeléf principle: 


Let h(z) be analytic in the half-strip 

(1.13) cs Imzsd, 0 Ss Rez < ~, 

and let 

(1.14) is) | a, cSysd. 

0 

Then for every strip (c’, d’) interior to (c, d) we have 

(1.15) | h(x + iy)| < Moe, 
Proof. Let z be any point in (1.13) and define 

(1.16) H(z) = f h(t)dt. 

The function H(z) so defined is analytic in (1.13) and is independent of the 


path as long as the path lies in (1.13). On taking the path to be the sum of 
two straight-line segments (ic, i Im z) and (4 Im z, 2) we find 


s f | | dy + | + iy) | dus Ma + 


Thus H(z) is bounded in (1.13). Moreover by (1.14) we see that H(z) ap- 
proaches a limit as z approaches infinity along any one horizontal half-line 
in ¢SIm zd. Thus by a well known form of the Phragmén-Lindeléf theo- 
rem H(z) approaches one limit value uniformly in the strip (1.13). 
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Next if c<c’<d’'<d and 6 is a sufficiently small positive number, then 
for c’<y<d’' and 6<x, 


Me + iy) = 
(¢ — 2)? 
1 H(s + be) 
J, do — (const.) = 0, 
as x—+>0, the convergence being uniform in c’<y<d’. Thus (1.15) holds. 
2. Norms and their properties. Before giving the necessary lemmas and 
definitions for our work on integral equations we will make a few remarks 
which indicate the motivation for the remainder of the section. In order to 
include the previously stated theorems on difference equations under our 
theorems on integral equations, as well as to have more generality in the 
integral equations themselves, we must dea‘ with functions p(A) of bounded 
variation on every finite interval but not necessarily on the whole infinite 
interval. Thus the expression S| dp(r)| need not necessarily be finite for the 
functions p(A) which we consider. We shall, however, impose a certain con- 
dition on the behaviour of p(A) which will enable us to form Radon-Stieltjes 
integrals(*) of the form SAA) dp) for certain classes of functions h(A). In 
fact we shall define a norm Ni(~) for right-continuous functions p(A) of 
bounded variation on every finite interval and we shall work with functions 
PA) of finite norm, Ni(p) < ©. If p(A) and g(A) are two such functions, each 
of finite norm, then the convolution Sa —)dp(u) need not necessarily 
exist, but nevertheless a certain modified convolution can be shown to exist, 
this modified convolution serves the usual purposes required for the ordinary 
convolution. For example, if P(w) and Q(w) are the Fourier-Stieltjes trans- 
form of p(d) and g(a), respectively, Q(w) 
then the product R(w) = P(w)Q(w) is the Fourier-Stieltjes transform of the 
modified convolution r(A) = p(A) Xq(A). 

_ We will also define a (weak) norm N(f) for analytic functions f(z) in a 
strip a<Im z<8, this norm being related to our Ni-norm, and we shall show 
in particular that the ordinary convolution Sf (z—rdp) exists and yields 
a function analytic in a<Im z<6 and of finite N-norm at most N(f)Ni(p), 
whenever N(f) and N;(p) are finite. The fact that the N-norm is a weak norm 
will be useful in our functional equations since it will enable us to treat 


(*) The integrals which occur in this paper are understood to be Lebesgue-Stieltjes (Radon) 
integrals. See, for example, Saks, Theory of the integral, 2d revised edition, Warsaw-Lemberg, 
1937, pp. 19 and 67. We note that the familiar principles of monotone and dominated conver- 
gence in the Lebesgue theory as well as the symmetric Fubini theorem are also valid in this 
theory; see pp. 28, 29, and 77. In this paper we shall not admit + © as members of our number 
system ; that is, existence implies finiteness. 
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analytic functions of finite exponential growth in a strip, the rate of growth 
being permitted to increase as the rate of growth of the function p(A) of 
bounded variation decreases. 

We now proceed with the necessary definitions and lemmas. Let two posi- 
tive continuous functions E(x) and E,(x) satisfying 


(2.1) E(x + y) S E(x)Ei(y), + y) S Ei(x)Ei(y), 


for all x and y, be given. These functions will be designated throughout the 
paper by this same notation. In practice, E(x) and E,(x) will usually be 


«#20, 


x20, 


E = 
x <0, { 


Be) = | 


with a<§. The reader can readily verify that these functions satisfy (2.1). 
In terms of definite preassigned functions E(x) and E(x), we define the fol- 
lowing two norms: 


DEFINITION 1. Let p(A) be a right-continuous function of bourded variation 
on every finite interval. Then we define the norm 


DEFINITION 2. Let f(z) =f(x+<iy) be defined in a strip a<y<b. Then we 
define the norms 


(2.3a) = f “E(a) | fle + iy) | dz 


and 
(2.3b) N(f) = sup N(f; y). 
a<y<d 


In practice the N-norm will be used for functions which are analytic in a 
strip. Nevertheless we use this general definition which applies to any func- 
tion of a complex variable, since on occasions we will use the finiteness of the 
norm to aid in proving the analyticity. 

We shall use the term “of finite norm” with reference to both functions 
of a complex variable and functions of bounded variation, with the under- 
standing that the appropriate norm will be used in each case. While there is 
a possible overlapping of these types of functions, in practice no confusion 
will arise from the use of the term “of finite norm.” 

We shall use the following properties of the norm, where the existence 
(finiteness) of the right side implies the existence of the left. 
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LEMMA 2.1. 
(2.4) Ni(pi + S Nil(pr) + 


(2.5) N(fi + fe; S N(frs y) + 9), 
(2.5a) N(fi + fe) S + N(f2), 


The inequalities (2.4), (2.5), (2.5a), (2.7) follow immediately from the 
definitions; so we proceed to the proof of (2.6). Obviously it is sufficient to 


prove 


and we may without loss of generality assume that y=0. Then we have 


s fl 490)| -»)| ax 


= f | 490) | f “E(x) | f(a)| de 


= Ni(p)N(f; 0). 


(It is clear from this argument that -|dpa)| and hence 
J_f(x—d)dp(A) must exist for almost all x when Ni(p) and N(f; 0) exist.) 


LEMMA 2.2. Let f(z) be analytic in a<Im2z<b, and let 6>0. Then for 
any point z=x-+ty in the sirip a+25<Im z<b—26 we have 


2 
(2.9) | #2) | s — 
|z—-&|<é 


This follows from the fact that f(z) is a harmonic function of x and y. Thus 


7 
> 
i 
| | 
4 
| 
} 
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1 
f(z) = f(u iv) duds, 


f(z) | | + iv) | 


< f flu + in) | 


po 
sup [aol f f | f(u + iv) | E(u)dudv 


<8 
sup [E(é)}"! N(f)do 
<6 


= 25 sup 


<6 
LEMMA 2.3. If f(z) is analytic in the strip a<Im2z<b and if N(f) and 
Ni(p) are both finite, then _f(z—d)dp(d) is analytic in the strip a<Im 2<b. 
To prove this, we shall show that 


M 
(2.10) fle — »)dp(d) = fle — »apr) 
0 


uniformly in z in the rectangle R3:a+25<Im z<b—26, A+65<Re z<B-—G, 
where A and B are arbitrary. Now by (2.9), (2.6), (2.2) we have in R;: 


M2 2 


2 M2 
xi A<E<B 


since we can consider the finite integral as an infinite integral with p(A) re- 
placed by a function which is constant outside the finite interval. Thus 
lim wi, My+0 *f(z—d)dp(A) =0 uniformly in and the uniform convergence 
in (2.10) is established. Since the finite integral is obviously analytic, the same 
is true of the infinite and also of the two-way infinite integral. 


LemMA 2.4. Let {f,(z)} be a sequence of analytic functions in the strip 
a<Imz<b such that 


nel 


Then the series 
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(2.12) f(s) = falz) 


n=l 


converges absolutely and uniformly in every closed finite region contained in the 
strip a<Im z<b and represents an analytic function f(z) of finite norm in the 
strip. Moreover 


(2.13) Ni) 
and if Ni(p) < then 
(2.14) f ‘| > — »| dp(d) = > f ¥y" — d)dp(a) 


where the right member converges uniformly in every closed finite subregion of the 
strip. 


The absolute and uniform convergence of (2.12) in closed finite subregions 
of the strip follows immediately from (2.9) and (2.11). Thus f(z) is analytic 
in the strip and we have 


= f + ax = f E(x) fale + iy) 


< fle t iy)| dx = 20) | fle + ae 


Moreover if h(z) =f" f(z—d)dp(d) and h,(2) =f fa(z—A)dp(A), and a+26 
<Im 2<b—26, then 


M-1 
| 


(2.15) | A(z) — DO : fale — A)dp(r) | 


2 

2 

[by (2.6) ] < — Ni(p)N ( sup [E(é)]-! 
n=M <8 


2 
[by (2.13)] sup 


[by (2.11)] —0as 
This yields (2.14). 


9 
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LEMMA 2.5. Let p(A) be a right-continuous function of bounded variation on 
every finite interval and of finite norm. For any values of \ and up we have 


1 
—u) 


(2.16) 
AO 


{ sup 

rE lor} 

where a2| denotes the closed interval with endpoints ay and ot (a 2 a2, <a). 
Proof. 


{ sup 
Ex(v) 


The second inequality in (2.16) follows from (2.1). Thus 
1/E,(v — w) S Ex(u)/Ex(r). 


LEMMA 2.6. If p(A) and q(A) are of bounded variation on every finite interval 
and of finite norm then the modified convolution 


\ wae). 


exists and is of bounded variation on every faite interval ond is finite norm, 

By (2.16) 


{ f 20 | dq(u) | 


\ Ex(r) bw 


Thus the integral in (2.17) exists (is finite). If p and g are both increasing 
functions then the modified convolution is increasing. It readily follows that 
(2.17) is of bounded variation in the general case. For (2.18) we have 


i 
| 
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f dy feo — — u) | 


[ve — — p(— | dg(u) 


-f “| | f Ex(u + ») | dp) | 


sf | | f Ex(u) Ext») | dp(o) | 


= Ni(p)Ni(q); 
which proves (2.18)(5). 

In some places it will be useful to have our functions normalized in the 
following way. We call p(A) normalized if (i) p(A) is right-continuous and 
(ii) p(0) =0. 

COROLLARY TO LEMMA 2.6. If p is normalized then the convolution p Xq is 
normalized. If p and q are normalized then the convolution is symmetric: 

Lemma 2.7, Let {pa(d)} be a sequence of normalized functions, each of 
bounded variation on every finite interval, such that 


(2.19) D Nilpn) < 


Then on every finite interval the series 


(2.20) P(r) = pald) 

1 
converges uniformly to a function of bounded variation, and is a function of 
jintte norm. Moreover 


(2.21) Nip) ilps). 


(®) To see that the interchange of order of integration just made is justifiable compare 
An unsymmetric Fubini theorem by R. H. Cameron and W. T. Martin, Bull. Amer. Math. Soc. 
vol. 47 (1941) pp. 121-125. Throughout the paper we shall frequently make use of the result to 
permit an interchange of order of integration of two integrals of this form. 
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Furthermore, if Ni(q) < ©, then 


(2.22) me 
= { — — Pa(— u)} dg(u), 


where in every finite interval the right member converges uniformly to a function 
of bounded variation. 


The uniform convergence of (2.20) follows from Lemma 2.5. That the 
sum is of bounded variation can be seen by separating into increasing parts. 
Thus if 


+ 
#0) = | dpa(u) | + pa(d) 
then 
Nilpa) S 


and 


por) = = D p20), 


n=l 


so that (A) is of bounded variation on every finite interval. 
Next, (2.21) follows for increasing parts: 


ad t t 
s[_mx 200] - 


—MS)3M 


—MS\<M 
+ 
—|¢(—M) M) 
1 
—0 as K— o, for fixed M. 
This yields 


M M 
= f < > Ni(pe). 
— n=l 


—M 


Letting M approach infinity we obtain (2.21) for increasing parts. For the 
general case, 


| 
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Ni(p) = Nilpt — S + 
1 1 1 


and 


nawk+1 k+1 


+2 D0 Nilpn) Nila) as &. 
1 1 


k+1 


This yields (2.21) in general. 
For (2.22) we use Lemma 2.5 and condition (2.19) and obtain 


— — pa(— w)| | dq(u) | 


—o n=l 
1 


This enables us to apply the principle of dominated convergence to obtain 
(2.22). That the right member of (2.22) converges uniformly in every finite 
interval to a function of bounded variation follows immediately from Lemma 
2.6 and the earlier part of the present lemma already proved. 

3. The difference equation with constant coefficients. We shall now treat 
equation (1.1). We obtain a slightly more general result than Theorem 1 in 


terms of an integral equation. 
THEOREM la. Let p(d) be a right-continuous complex-valued function of 


bounded variation on every finite interval, whose singular part is zero; let p be of 
jintte norm, 

(3.1) Ni(p) = | | + f dp0)| < @, 

and let p be such that the (analytic) function 

(3.2) P(u) = f 


is non-vanishing in the strip a<Re w<f. Let g(z) be a function analytic in 
the strip a<Im 2<b and of finite N-norm, 


0 
(3.3) N(g) = sup, | + f e*| g(x + iy)| < 


: 
f 
i 
| 
| 
| 
| 
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Then the equation 
(3.4) -nap0) = 


has one and only one solution f(z) which is analytic in the strip a<Im z<b 
and which is of finite norm N(f)< ~. 


For the proof let us notice that since P(w) is analytic and non-vanishing 
in the strip a<Re w<@ and since (3.1) holds it follows from a result of 
Pitt (loc. cit.) that 1/P(w) is expressible in the form 

1 
P(w) 
where p*(A) is a function of bounded variation on every finite interval with 
finite norm. Also(*) since 

P(w) 


- [ease oa 


(3.5) = f a<Rew<f, 


it follows that } we [p(v —u) —p(—p) ]dp*(u) is a function with a unit jump at 
the origin and constant elsewhere. Next we form the function 


(3.7) = fale — 


which by Lemmas 2.3 and 2.1 is analytic in the strip a<Im z<6, and of 
finite norm. Moreover 


f ‘fe — r)dp(d) = fin 


= g(z). 
Thus f(z) satisfies the equation (3.4). 
(*°) See Footnote 4. 
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For uniqueness let us assume that f is any solution of (3.4) of finite norm. 
Then 


that is, f= f. 

This concludes the proof of Theorem 1a, and also shows that the solution 
f(z) is given explicitly by (3.7). 

Theorem 1 is an immediate consequence of Theorem 1a in view of Lemmas 
1.1 and 1.2. 

4. The difference equation with one linear coefficient. We now deal with 
equation (1.5), and as in the previous case, we obtain a slightly more general 
result expressed in terms of an integral equation(’).” 


THEOREM 2a. Let p(Ad) be a right-continuous function of bounded variation 
on every finite interval and such that 


0 
(4.1) = f dp0)|+ dp(r)| < ©. 


Let yo=Im[p(0+0)—p(0—0)] and let a<Im 2<b be any strip whose closure 
does not contain the line Im z=yo. Let g(z) be a function analytic in the strip 
a<Im z<b and of finite norm: 


(4.2) = sup, | fe g(x + iy)| dx + f g(x + iy)| dx 
Then the equation 
(4.3) afte) = f fle — + 


has one and only one solution f(z) which is analytic in the strip a<Im z2<b 
and which is of finite norm, N(f)< @. 


We base the proof upon the following lemma. 


Lemma 4.1. Let p(A) be a function of bounded variation on every finite in- 
terval and of finite norm, Ni(p)< ©. Let 2:=%x,+1y, be any complex number, 
and leta<Im 2<b be any strip whose distance from the line Im z=, ts greater 
than Ni(p). Let g(z) be a function analytic in the strip a<Im 2<b for which 
N(g) <0. Then the equation 

(7) This result and its proof were suggested by applying certain duality considerations toa 
corresponding result on differential equations. See R, H. Cameroa, Quadratures involving trigo- 


nometric sums, Journal of Mathematics and Physics vol. 19 (1940) pp. 161-166. This result is 
however certainly not a direct dual since g(z) need not have any Fourier transform. 


| 
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(4.4) — afte) = + 


has one and only one solution f(z) of finite norm and analytic in the strip 
a<Imz<b. In addition 


N(g; 
ly — — 
We shall prove Lemma 4.1 by the method of successive approximations. 
Let 


(4.5) s a<y<b. 


fo(z) = g(2)/(s — 21) 
and 


1 
(4.6) fala) = f 


Z— 2 
each f,(z) existing in view of Lemma 2.1. We shall show that 
(4.7) He) = fale) 
n=0 


also exists and satisfies the conclusions of Lemma 4.1. First, by Lemma 2.1, 


1 
N(fo; y) —— Mg: ») 
ly — 


1 
ly— ly — 


a<g<b,n=1,2,---. 


Thus 
N(g; 9) 
|" ly — — 
By Lemmas 2.4, 2.2, and equation (4.8) we see that the functior defined by 
(4.7) is analytic and of finite norm in the strip a<Im z<6 and that (4.5) 


holds. 
To see that the f defined by (4.7) satisfies equation (4.4) we note by (4.6) 


that 


1 
(4.9) f(s) = folz) + fa(z) = file |. 
n=l —o 


(4.8) 


n=0 


By Lemma 2.4, we can interchange the order of summation and integration 
and thus 
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(4.10) + = |. 


Z— 21 
that is, f(z) satisfies equation (4.4). 


For uniqueness let us assume that f*(z) is a solution of (4.4), analytic in 
a<Imz<0 and of finite norm. Then 


1 
fle) — f(@) = f -» - - 


Z— 2) 


1 


a<y<b. 


Since the second factor is positive, f* —f=0. 

This concludes the proof of Lemma 4.1. 

We now proceed to the proof of Theorem 2a itself. Let us therefore return 
to the notation of that theorem and assume that the hypotheses of the theo- 
rem are satisfied. We begin by introducing a transformation which transforms 
equation (4.3) into one to which the preceding lemma applies. This trans- 
formation will be based upon the following function: 


(4.11) R(w) = exp dp(r) +f |, a<Rew< 


the 7 being chosen positive and so small that 

(4.12) Ni(p:) < min {| yo — | — 5] }, 

where 

p(n) — p(0 + 90), A}, 
— — 0), —nSrA <0, 
1) — p(0 — 90), 


(4.13) = 


This is possible since by hypothesis yo lies outside the closed interval a Sy Sb 
so the right member of (4.12) is positive, and the variation of p:(A) can be 
made arbitrarily small due to the continuity of p:(A) at the origin. Then if 


0, 
50) <0, 


17 
and 
or 
Ni(p) 
ly — 
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and(8) 29 =xo+iyo= p(0+0) —p(0—0), we have 


ew 
(4.14) R(w) = exp { f elem) — pi(d) . 


We next show that there exist functions r(A) and r*(A) such that 


(4.15) 
and 
(4.16) 


and 


(4.17) = f bro — t) — r(— |dr*(2). 


For let go(A) =7(A) and 


(4.18) 


and 
(4.19) gn(A) = f — — m= 


Since go and q; are normalized it follows by the corollary to Lemma 2.6 that 
all the g, are normalized. 


Now 
= n=0,1,2,---. 


Next define 


4 
and 

1 n 
(4.21) r*(d) = gn(d). 


n=0 n! 


(*) Note that this is consistent with the definition of yo given in the statement of Theo- 
rem 2a. 
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By inspection go(A) is of finite norm, and the same is true of g:(A), when we 
note that the function with respect to which we integrate is constant in a 
neighborhood of the origin. By equation (2.18) it follows that each q, is of 
finite norm with Ni(q,) S$ [N:(q:) |", and by Lemma 2.7 that r(A) and r*(A) 
defined in (4.20) and (4.21) exist and are of finite norm. Then 


R(w) = exp { f alo) — 


exp { 


We shall show that the integration and summation can be interchanged, thus 
giving 
1 
R(w) -f oval = f edr(h). 
n! 


n=0 


The interchange is justified by the following argument: 


1 M 


n=Q ond 


M M 
e’Mr(M) — M) — wf r(A)e"drA = edr(d). 


—M —M 


By the Weierstrass M-test and the convergence of 
N 
n=l 


it follows that the above series in the first member converges uniformly for 
all real M and thus as M—~ the limit of the sum is the sum of the limits. 
Thus (4.15) holds; and it is obvious that (4.16) can be proved in the same 
way. Moreover, (4.15) and (4.16) imply (4.17). Returning now to (4.14), we 
differentiate and obtain 


(4.22) dR(w)/dw = R(w)[P(w) — Pi(w) — 20] 


where 


— 
| 
i 
| 
i 
i 
] | 
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Rewriting (4.22) by replacing each side by the function of which it is a 
Fourier-Stieltjes transform, and noting that 


dR(w) a 
fe dy nér(n) 


we obtain 

Now consider the functional equation 

(4.24) (2 fa) = — + 


where 


(4.25) ts) = — 


It is clear by (4.12) that the functional equation (4.24) satisfies the hypothesis 
of Lemma 4.1, and thus has one and only one analytic solution f;(z) of finite 
norm in a<Im z<b. We can therefore consider that f;(z) is completely defined 
by (4.24), and in terms of it we define 


(4.26) f(z) = f nus — é)dr(t), a<Imz <b. 


We shall show that this f(z) is the unique analytic solution of finite norm of 
the original equation (4.3). By Lemmas 2.3 and 2.1, f(z) so defined is analytic 
and of finite norm. Also by (4.24) we have 


fie Zo) frlz - t)dr(t) 
= fan é)dpi(d) + f — é)dr(?). 


Transposing, simplifying, and using (4.25) and (4.17), we find 
fils — ddr(t) = fe — ttdr(t) + sof — é)dr(i) 


By (4.26) and (4.23), this yields 
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ji | 16 nar) | dp(r) + g(2) 


-f “fle — »)dp(d) + 


Thus f(z), defined in (4.26), satisfies the original equation (4.3). 
Similarly if any f(z) satisfies (4.3), f(z) assumed analytic and of finite 
norm in a<Im z<8, it is seen that the function 


satisfies equation (4.24). Thus the uniqueness of the solution of (4.24) implies 
the uniqueness of the solution of: (4.3). 

This concludes the proof of Theorem 2a. 

Theorem 2 follows from Theorem 2a and Lemmas 1.1 and 1.2. 

5. The difference equation with linear coefficients. 


THEOREM 3a. Let p(A) and g(A) be sateen functions of bounded 
variation on every finite interval such thai 


(5.1) Ni(p) < ©, Nilg) < singular part of p = 0. 
Moreover let the analytic function 


(5.2) P(w) = f “e~dp(n) 


be non-vanishing in the strip a<Re w<B. Define(*) 
ytTi 
3 = lim — |. 


Let a<Im 2<b be any sirtp whose closure does not contain the line Im z= yp. 
Let g(z) be a function analytic in the strip a<Im 2<b and of finite norm, 


(5.4) N(g) < @. 
Then the equation 


has one and only one analytic solution f(z) of finite norm in the strip a<Im z<b. 
(*) This limit exists and is independent of y for a<7<8. 


* 
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For the proof we again use Pitt’s theorem (loc. cit.) which yields the result 
that 1/P(w) is expressible in the form (3.5) with Ni(p*) < . 
We next convolve equation (5.5) with p*(u) and obtain 


or 
(5.6) sf(s) — f f(z — d)dq(r) = &(2) 
where 


5.7) = f "ge — w)dp*(u). 


We note by Lemmas 2.3, 2.6, and 2.1 that g(z) is analytic in a<Im 2<b and 
that 


(5.8) 


Moreover(!°) 
(5.9) yo = Im [7(0 + 0) — G(0 — 0)]. 


Now we apply Theorem 2a to (5.6). We note that the hypotheses of that theo- 
rem are satisfied with p and g replaced by @, g. Thus there is a unique analytic 
solution f(z) of (5.6) of finite norm. Hence if (5.5) has an analytic solution of 
finite norm it is unique. But the solution of (5.6) obviously satisfies (5.5) as 
we see by convo!ving equation (5.6) with ». This yields the theorem. 

As in the earlier cases it is easily seen that Theorem 3 follows from Theo- 


rem 3a. 


(©) That (5.9) agrees with (5.3) is easily seen. 
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A GENERALIZED THEORY OF DYNAMICAL 
TRAJECTORIES 


BY 
EDWARD KASNER AND JOHN DeCICCO 


1. Introduction. The differential geometry of the dynamical trajectories 
of positional fields of force has been developed in the Princeton Colloquium(’). 
The fields of force, considered heretofore, depend only upon the position of 
the point. We shall call a field of force generalized if it depends not only upon 
the position of the point but also upon the direction through the point. In 
this paper, it is our purpose to begin the development of the geometry of the 
dynamical trajectories of such generalized fields of force in the plane. Posi- 
tional fields may be described as isotropic and generalized fields as anisotropic. 

In a generalized field of force, there are ©* dynamical trajectories. These 
are represented by a differential equation of the form 


(G) = G(x, y, + A(x, 


Conversely, any system of curves of the type (G) may represent the dynami- 
cal trajectories of /‘) generalized fields of force(*). We therefore find it ap- 
propriate to term any system of curves of the type (G) as generalized 
dynamical trajectories. 

Of course, not every such system of curves represents the trajectories of 
a positional field of force. Kasner proved that the family of ordinary dynami- 
cal trajectories is characterized by a set of five independent geometric prop- 
erties. Of these, the Property I is equivalent analytically to stating that a 
system of ordinary dynamical trajectories must be represented by a differ- 
ential equation of the type (G). The other four geometric properties specialize 
the functions G and H. 

The Property I is described as follows. For each of the ! curves of the «? 
trajectories which pass through a given lineal element Z, construct the oscu- 
lating parabola at E. The Property I is given by any one of the following 
three equivalent statements(?). 


Presented to the Society, October 31, 1942; received by the editors August 28, 1942. 

(*) Kasner, Differential-geometric aspects of dynamics, Amer. Math. Soc. Colloquium Pub- 
lications, vol. 3, 1913, 1934 (referred to as Princeton Colloquium). Also see a series of papers in 
Trans. Amer. Math. Soc. vols. 7-11 (1906-1910). 

(?) The symbol /@) means that the force vector contains an arbitrary function of two 
variables. See Kasner, A notation for infinite manifolds, Amer. Math. Monthly vol. 49 (1942) 
pp. 243-244. 

(*) Recently Terracini has given an alternate projective characterization of Property I by 
conic sections. See Sobre la ecuacion diferencial y’'’ =G(x, y, y’)y"+H(x, y, y’)y’", Revista 
de Matematicas vol. 2 (1941) pp. 245-329. 
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(I,) The locus of the foci is a circle through the point of £. 

(Ip) The directrices form a pencil of lines. 

(I,) The envelope of the «! parabolas is a straight line. 

Next we shall compare the «' lines of force and the ©? rest trajectories. 
Let a particle which is initially at rest start from a point O. The rest trajec- 
tory will then be tangent to the line of force at O. We study the ratio p of 
the departure of the rest trajectory to that of the line of force at O. In the 
positional case, Kasner showed that p=1/3, or, if the order of contact of 
each of the curves with their common tangent line is m, p=1/(2n+1). We 
obtain a converse theorem determining all generalized fields of force for which 
this ratio assumes the above values. (See formulas (35) and (45).) 

Finally we shall define for our generalized fields of force other systems of 
curves, namely, velocity systems, systems S;, and pressure curves. Each of 
these systems is of the type (G) and hence possesses the Property I. While 
the systems S;, and the pressure curves may be represented by a differential 
equation of the type (G), it is found that velocity systems must be curvature 
trajectories(*). 

2. The differential equation of the generalized dynamical trajectories. 
Let @ and w be the rectangular components parallel to the x- and y-axes of 
the generalized force vector acting at the lineal element (x, y, y’)(5). We as- 
sume that our force vector (¢, y) does not identically agree with the direction 
of the corresponding lineal element E£. For, if Y—y’¢@=0, the trajectories are 


straight lines, and are, therefore, of no interest. The equations of motion of 
a particle of unit mass are 


(1) d*x/dt® = $(x,y, y'), = y, 


The particle may be started from any position (xo, yo) with any velocity 
(dxo/dto, dyo/dto). A definite trajectory is described. Since the same curve may 
be obtained by starting from any one of its ©! points, the total number of 
trajectories, for all initial conditions, is ©*. 

We now proceed to eliminate the time ¢ from the equations (1). (For the 
positional case, see the Princeton Colloquium, pp. 7-9.) The elimination can 
be performed directly from (1), but we prefer to do it in the following manner. 
Let N and T be the normal and tangential components along a trajectory. 
Then 

N = — + T= + ; 


2 
p= (N+ yT)/(1 + 


(*) Kasner, Dynamical trajectories and curvature trajectories, Bull. Amer. Math. Soc. vol. 40 


(1934) pp. 449-455. 

(5) The functions ¢ and y could have been assumed to involve x and y and also te be 
homogeneous of degree zero in dx/dt and dy/dt. Then since during motion dx/dt and dy/dt are 
not both zero, the functions ¢ and y afe seen to depend on (x, y, y’) only. 
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By these equations, it is observed that either ¢ and y, or N#0 and T, are 
two arbitrary functions of (x, y, -y’). 
If v denotes the speed of the particle and r the radius of curvature, we find 


(3) ov = rN, vv, = T. 
Eliminating the speed v from these equations, we find the intrinsic differential 
equation of the generalized dynamical trajectories to be 
(4) d(rN)/ds = 2T. 

Upon putting this equation into the cartesian form, we find the following 
result. 

THEOREM 1. The ~* dynamical trajectories of a generalized field of force are 
given by the differential equation 
(S) = — 2) — + We — — 36] 9. 
This is of type (G). 


It is noted that this represents the ordinary dynamical type if ¢ and y 
are independent of y’. 

3. The type (G) always represents the trajectories of a generalized field 
of force. In order to simplify our discussion for generalized fields of force, 
it is convenient to introduce the functions a and £8 defined as follows: 


et = (1+ B = — 2T7/(1+ y)N; 
We shall also write the equations connecting (¢, ¥) and (a, 8) as follows: 
= (1+ — 99), B= —2¢+ y'¥), 
2(1 2(1 + 


(6) 


Now placing the last two of equations (6) into the differential equation (5), 
we see that the differential equation of the generalized dynamical trajectories 
is 


(8) = (as + yay) y” + (ay + 


THEOREM 2. Any system of curves of the type (G) represents the dynamical 
trajectories of ~/) generalized fields of force. 

We proceed to the proof of our theorem. Let a system of curves of the 
type (G) be given. If there exists a generalized field of force of which the 
given curves are the trajectories, the functions a and 8 must be determined 
so that 
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(9) a:tya,=G, 


By the theory of differential equations, we know that the first equation 
determines a@ as an expression of the form 


(10) a = a;(x, y, + arly — xy’, y’), 


where a is a definite function of three variables and a; is an arbitrary function 
of two variables. The second equation then completely determines 8. This 


completes the proof of our Theorem 2. 
For example, all generalized fields of force whose dynamical trajectories 


are the vertical parabolas y =ax*+bx+c, are determined by 
(11) a = F(y — xy’, y’/), B= «Fy — Fy. 


4. Fields of force depending only upon differential elements of order 
n for n=2. The argument presented in the preceding section may be general- 
ized to prove the following result. 


THEOREM 3. Any system of «*"+! curves for n22 may represent the dynami- 
cal trajectories of 1/("+®) more extensive fields of force depending only upon dif- 
ferential elements of order n. 


The equations of motion of a particle of unit mass for a field of force which 
depends only upon the position of a differential element of order n, are 


(12) d*x/di® = o(x, y, dy/d?® = (x, y, y™). 


Now if N and T represent the normal and tangential components of the 
force vector along any trajectory, it is found by equations analogous to (2) 
that N and T may be considered to be any two arbitrary functions of 
(x, y, y’, > ++, y™). Then since »22, the functions N* and 7* defined by 


(13) T* = 2T(1 + 


are also arbitrary functions of (x, y, y’, - - 

The intrinsic differential equation of the trajectories of such an extensive 
field of force is given by an equation analogous to (4). Therefore replacing 
rN and 2T by the functions N* and T*(1+~’?)—? and performing the differ- 
entiation, the differential equation of the dynamical trajectories is 


(14) y+) = (1/N$m)[7" — (NE + Ny + + OM 


Therefore there are ©**+! dynamical trajectories in an extensive field of force 

depending only upon the position of a differential element of order »22. 
The right-hand side of (14) may represent any arbitrary function of 

(x, y, y’, -- +, y™). Also the choice of T* is uniquely determined as soon as 
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N* is given. Therefore there are ©/(+® extensive fields of force whose trajec- 
tories are a given family of **! curves. 

5. The geometric Property I. Before discussing the Property I, it is ad- 
visable to collect here the various formulas for the osculating parabola of a 
curve(*). The equation in running coordinates (X, Y) of the osculating parab- 
ola at any differential element of the third order (x, y, y’, y’’, y’’’) of the 


curve y=~¥(x) is 
— — x) — — 
(15) 
+ 18y"*[y/(X — x) — (Y — y)] =0. 


The focus is 


3y"(1 + iy’)? 
+ iy’) 3iy’’?] 


(16) X+i¥ =x+iy+ 


The equation of the directrix is 
(17) 2y/"(X — x) + — — y) = 3y"(1 + 


Consider now any system of «* curves. For each of the ~! curves of this 
system which pass through a given lineal element £, construct the osculating 
parabola at E. This gives rise to ©! parabolas all containing the lineal ele- 
ment E. 


THEOREM 4. A system of ~* curves is a generalized dynamical system if and 
only if the locus of the foci of the ~' osculating parabolas, constructed at any 
lineal element E, is a circle passing through the point of E. 


The equation of this focal circle is 
2G[(X — + (¥ — y)*] — + + 311 — — 2) 
+ [(1+ — — y) =0. 


THEOREM 5. A system of ~* curves is a generalized dynamical system if and 
only if the directrices of the ~' osculating parabolas, constructed at any lineal 
element E, form a pencil of lines. 


(18) 


The vertex D of this pencil of lines is 
(1+ 
2G 


Consider now any system of «* curves. At a fixed lineal element E, y’’’ is 
a function of y’’ only. Now let w represent the expression y’’’/y’’? so that at E, 


(19) D: X+i¥Y=x+iy+ [— (y'H — 3) + iH]. 


(*) For an analogous discussion in minimal coordinates, see Kasner and DeCicco, Families 
of curves conformally equivalent to circles, Trans. Amer. Math. Soc. vol. 49 (1941) pp. 378-391. 
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w is a function of y’’ only. By (15) the ©! osculating parabolas at E are repre- 
sented by the equation 


(20) y"[(y’w — 3)(X — x) — — y)]? + 18[y(X — x) — (Y — y)] = 0. 


Now if w,-=0, our family of parabolas is linear. The foci will all lie on 
the straight line into which the focal circle (18) degenerates by letting G ap- 
proach zero. The directrices will all be parallel, their directions being per- 
pendicular to the direction so that the line of E bisects the angle at the 
point O of E formed by this direction and the focal straight line. The complete 
envelope of this linear family of parabolas consists of the line of E and the 
line at infinity. 

Any system of «* curves such that the «! osculating parabolas, con- 
structed at E, form a linear family must be of the special subclass of type (G) 


(21) = A(x, 


Let now w,#0. Thence our family of ©! parabolas (20) possesses an 
envelope (in addition to the common tangent line of the element EZ). This en- 
velope is given parametrically by the equations 


3(w + 
» 


(22) 
Now differentiating these partially with respect to y’’, we find 


3 
xy = + wy y)(w + 


(23) 


“9 


Our envelope is obviously a point if and only if 3wy: + 2y’"wyy- =0. There- 
fore the family of ©! osculating parabolas, constructed at the element E, pass 
through a fixed point if and only if the differential equation of the system of 


oo curves is of the form 


(24) = p(x, y, + v(x, 9, 


Now we exclude this situation. Then the slope dY/dX of our envelope 
(21) is given by the equation 


(25) dY/dX — = —3/Wwt - 


This formula demonstrates that our envelope is a straight line if and only 
if y’'‘w=G+Hy’’. Therefore we may state the following result. 


THEOREM 6. A system of ~* curves is a generalized dynamical system if and 
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only if the envelope of the ~' osculating parabolas, constructed at any lineal 
element E, is a straight line (not passing through the point of E). 


This conimon tangent line y of the ©! osculating parabolas is given by 
the equation 


(26) 2G(y’'H — 3)(X — x) — 21GH(Y — y) +9=0. 


Thus our ©! parabolas are tangent to the line y, the line of the element E 
at the point of EZ, and the line at infinity. 
Our line y intersects the line of the lineal element EZ in the point 


(27) P: X+i¥ = x + iy + (3/2G)(1 + iy’). 


Moreover we find that our line y is tangent to the focal circle at this point P. 
The line DP connecting the vertex D of the directrices and this point P, is 
perpendicular to the line of E. 

We remark that the one-parameter set of parabolas for a given element E, 
is in general quadratic. 

6. The angular rate \ and the terminal curve C. For the further develop- 
ment of our theory, it is found essential in a given generalized field of force 
to associate a number AJ to any lineal element E. As the element E rotates 
about its point O, the corresponding force vector F also rotates about O. We 
define the angular rate \ as the instantaneous rate of change of the inclination 
of F with respect to the inclination of EZ. It is given by the formula 

1+ 

?+y 

Also as E rotates about its point O, the end point of the corresponding 
force vector F describes a curve C. We shall call this the terminal curve C 
corresponding to the point O. The terminal curve C degenerates into a fixed 
point if and only if our field of force is ordinary positional. 

Obviously for an ordinary positional field of force the angular rate \=0. 
However there are also certain generalized fields of force for which the angular 
rate \=0. These fields are such that the terminal curve corresponding to any 
point O is a straight line through O. 

7. The lines of force and the rest trajectories. In this section, we wish to 
define two special important families of curves in a given generalized field of 
force, namely, the lines of force and the rest trajectories. Let us first consider 
the lines of force. In general, at a given point O, there is one lineal element Eo 
such that the direction of the force vector Fo is identical with that of Eo. 
Let us also suppose that the angular rate \ at such an element Zp is not unity. 
If in a certain region of the plane, there are ©? such lineal elements Eo, we 
define the integral curves of this set of ©? elements Ey as the lines of force 
of our generalized field of force. 


(28) voy’). 


> 
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The lines of force are given by the differential equation of the first order 
(29) y’) y'o(x, y’) = 0. 


The condition that the angular rate \¥1 is equivalent to stating that there 
exists a function y’=f(x, y) which satisfies this equation identically. Thus in 
general, there are ~' lines of force. 

A rest trajectory is defined as the path described in a generalized field of 
force by a particle which starts at rest from a given point O. From the equa- 
tions 


(30) dy/dt = y' dx/dt; d*?y/dt? = y''(dx/dt)? + y’ d*x/dt*; 


together with the initial conditions dxo/dtp=0, dyo/dto=0, it is immediately 
deduced that a rest trajectory is always initially tangent to the line of force 
through the given point O. Since on any rest trajectory, there is, in general, 
only one such point O where the particle is at rest, it is deduced that there 
are ©? rest trajectories in a given generalized field of force(’). 

8. The generalization of the theorem concerning one-third of the curva- 
tures.(*) We shall now study in a given generalized field of force the ratio p 
of the departure of the rest trajectory to that of the line of force at a given 
point O. For the ordinary case Kasner showed that this ratio p= 1/3. We shall 
find, however, a more general formula for p in terms of the angular rate X. 

At the point O, let Eo be the lineal element of the line of force. Then the 
rest trajectory described by a particle initially at rest at O will be tangent to 
the line of Eo. 

Now we shall assume that the curvature of the rest trajectory is not zero 
at the point O. Let us calculate the value of the second derivative y’’ of the 
rest trajectory at O. Since our rest trajectory is tangent to the line of force at O, 
the first derivative y’ must satisfy the equation (29). Therefore from this and 
the equation (5), it results that the second derivative y’’ of the rest trajectory 
at O is given by the formula 


(31) [ve + yy — — + — — 3] y” = 0. 


(7) Of course, we are assuming that in the region of the opulence (the totality of ©? lineal 
elements of the plane) where the generalized field of force is defined, there are points O (even in 
the imaginary domain) through which there pass such special lineal elements EZ» (one or more). 
Otherwise there are no theories of lines of force and rest trajectories. 

(*) For a generalization to acceleration fields of higher order, see Kasner and Mittleman, 
A general theorem on the initial curvature of dynamical trajectories, Proc. Nat. Acad, Sci. U.S.A. 
vol. 28 (1942) pp. 48-52; see also Extended theorems in dynamics, Science vol. 95 (1942) pp. 249- 
250. 

Also see Fialkow, Initial motion in fields of force, Trans. Amer. Math. Soc. vol. 40 (1936) 
pp. 495-501; and Kasner and Fialkow, Trans. Amer. Math. Soc. vol. 41 (1937). 
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The second derivative Y’’ of the line of force is calculated by taking the 
total derivative of the equation (29). Therefore this is given by 


(32) [ve + — 2) — + — — = 0. 
Subtracting these two equations, we find 
(33) [vy — Voy — 3] y” = — Voy — 


Since the magnitude of the force by assumption is not zero and since at Eo, 
v=y’d, we see that ¢~0. Upon dividing this equation by ¢ and making use 
of the equation (28) defining our angular rate \, we see that the preceding 
equation may be written in the form 


(34) (3 — = (1 — 


The angular rate \+1. For, otherwise, the line of force could not exist. 
Also if the angular rate \+3, the curvature of the line of force is not zero. 
The order of contact with the common tangent line of the line of force will 
be two or more if \=3. Thus we may state the following result. 


THEOREM 7. The ratio p of the curvatures at Ey of the rest trajectory and the 
line of force is given by the formula 


(35) 


For an ordinary field of force, the angular rate \=0 so that p=1/3. There 
are generalized fields of force for which the ratio p=1/3. We shall determine 
these in a later section. 

9. The generalization of the theorem concerning 1 /On+1) of the rates 
of departure. In this section, we shall generalize our Theorem 7 to the case 
where the rest trajectory has contact of order #22 with the line of the lineal 
element Ep at the point O of Eo. For this purpose, it is found advisable to 
take the element E> in the normal form, that is, its point O at the origin and 
its direction that of the x-axis. Therefore our rest trajectory may be written 
in the form 


(n+ 1)! (m + 2)! 


The line of force has at least first order contact with the line of Eo. Let m 
denote its order of contact with this line. Therefore our line of force may be 
written as 


Cm+2 


(37) 


{ 
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The equation (36) must satisfy the differential equation (5). In order to 
perform this substitution, we need to know the expansion of the left-hand 
side of (29) when (36) is placed into it. For that purpose, it is necessary to 
know the total derivative of order p of the left-hand side of (29). 

To solve this problem, it is convenient to introduce the following notation. 
If x is any function of (x, y, y’), then 


(38) = (= +y’ = (2 +7’ 
Ox oy Ox oy 


With this notation, we see that the first p derivatives of the left-hand side 
of (29) are 


— = — y'o™ + — — 

a 


(39) — yo) — + + — 9), 


d 


+ owe + yoy ¢), 


where y, y’, only. This last formula may easily be 
proved by induction. 

If the line of force has contact of order m with the line of Eo, not neces- 
sarily in the normal position, then from the preceding equations, it follows 
that 


yo == yo = == meee = = 0, 
yn) — + — yoy — ¢) =0. 

THEOREM 8. The order of contact m of the line of force with the line of Eo 
is equal to or greater than one less the order of contact n of the rest trajectory. 
That is, m=n—1. 

Let us assume that m<n. Upon substituting the rest trajectory (36) into 


the expression y —y’¢, we find that the power series expansion of this is (since 
m <n) 


(40) 


ede 


(41) 


m! 


| 
| 
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Placing this and also the series expansion (36) of the rest trajectory into 
the differential equation (5), we find that the power of the term of lowest de- 
gree is m-+-n—2 (since the condition m <n demonstrates that the term of de- 
gree 2n--2 is higher). Hence upon placing the coefficient of x"*"-* equal to 
zero, we find ‘ 


— — 1 — m) 
m\(n — 1)! 


Since ‘the angular rate \+1, it follows from this equation that if m<n, 
then m=n—1. Hence in all possibilities m 2n—1. Therefore the order of con- 
tact of the line of force with the common tangent line can never be less than 
the order of contact of the rest trajectory decreased by one. 

Now let p be the ratio of instantaneous departures of the rest trajectory 
and the line of force from their commen tangent line. For the case m=n—1, 
obviously p=0. Consider next the case where m=n. Of course Ca41%0 and 
Cu41%0 for m=n, but C.41~0 and C,,:=0 for m>n. The power series ex- 
pansion of ¥ —y’@ when the rest trajectory is placed therein, is 


— G0] — Cass] 


0. 


(42) 


(43) 


Placing this and the series expansion (36) of the rest trajectory into our 
differential equation (5), and putting the coefficient of the term of lowest 
degree 2m —2 equal to zero, we discover 

Cn+1 
— 1)! 

The ratio p of departures of the rest trajectory and the line of force from 
their common tangent line is defined by the ratio p =Cn41/C,41. Therefore we 
derive the following result 


THEOREM 9. The ratio p of the departures from the line of the lineal element 
Eo of the rest trajectory and the line of force is given by the formula 


(45) p = (1 —A)/(2n+1-—A)- 


By Theorem 8, we note that p=0 if and only if m=n—1. By this formula 
p=, thatis m>n if and only if the angular rate \=2n+-1. For any other 
case the orders of contact are the same, m =n, and p is a finite nonzero num- 
ber. 

For an ordinary field of force, the angular rate \=0 so that p=1/(2n+1). 
In the next section, we shall discuss all generalized fields of force for which 
the ratio p=1/(2n+1). 

If \+#2n+-1, then the order of contact m with the common tangent line 


- 
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of the line of force is n, or n—1, where m is the order of contact of the rest 
trajectory. However, if \=2n-+-1, then the line of force may have higher order 
contact with the common tangent line. This is a new phenomenon. It can never 
arise in ordinary positional fields of force. This new situation is that the line 
of force can have the same order of ¢ontact with the common tangent line 
as that of the rest trajectory, or else one less, or higher. 

10. Generalized fields of force for which the ratio of the departures 
p=1/(2n+1). By Theorem 9, we see that if p=1/(2n+1), then the angular 
rate \=0 at the lineal element Epo. 


THEOREM 10. All generalized fields of force for which the ratio of the depar- 
tures p=1/(2n+-1) must be such that the eliminant with respect to y' of the two 
equations 


(46) vy — oy = 0, 
is identically zero. 


This theorem means geometrically that the force vector Fo of the special 
lineal element E» (whose direction coincides with that of Fo) is tangent to 
the terminal curve C of the point O at the end point of Fp. 

11. Generalized velocity systems. For our generalized fields of force, we 
define systems of curves which are generalizations of velocity systems for 
ordinary positional fields of force. (For the positional case, see the Princeton 
Colloquium, pp. 42-44.) 

Consider a particle of unit mass in any generalized positional field of 
force with rectangular components ¢ and y. The equations of motion are 
then given by (1). If the initial position and the initial velocity are given, the 
motion is determined. If only the initial position and the direction of the mo- 
tion are given, the radius of curvature r will depend for its value on the initial 
speed. Hence, in addition to the usual formula for the speed v, there must be 
a formula expressing v? in terms of x, y, y’, r. This is furnished by the first of 
equations (3). This equation may be written in the form 


(47) v= — + 


In the actual trajectory v varies from point to point. If now we replace v? 
in this result by some constant, say 1/c, the resulting equation may be written 
in the form 


(48) y” = cy — y’o)(1 + y”). 


The curves satisfying this differential equation, they are not in general trajec- 
tories, we define as generalized velocity systems. For any generalized field of 
force, a curve is a velocity curve corresponding to the speed vo, provided a 
particle starting from any lineal element of the curve with that speed de- 
scribes a trajectory osculating the curve. !n a given field of force there are ~* 
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trajectories and ~? velocity curves. If c is given, we have »? velocity curves. 
For unit velocity, we have the following result. 


THEOREM 11. Any system of ~* curves may be regarded as the totality of 
velocity curves corresponding to unit velocity in ~/) generalized fields of force. 


Upon eliminating the constant c, we obtain the complete system of ’ 
velocity curves for a given generalized field of force. The differential equation 
of all velocity curves is 


— = [ve + — — 
+ [vy — yoy — 36 + 216 + + 


Therefore a complete velocity system is of the type (G), thus possessing the 
Property I. 

Observe that the expression (W—y’¢)(1+~"") of equation (48) can repre- 
sent any function of (x, y, y’). This result shows that any complete velocity 
system is identical with curvature trajectories(*). For an arbitrary family of 
co * curves, a curve is a curvature trajectory if at any lineal element £ of this 
curve, its curvature is ¢ times the curvature of the curve of the family which 
passes through E£. For a given family of ©? curves, there are * curvature 
trajectories. These possess an additional differential property in addition to 
the Property I. 


(49) 


THEOREM 12. Any system of ~* curvature trajectories may represent the 
velocity curves of 1+) generalized fields of force. 


By equations (6), it is seen that the velocity systems are the curvature 
trajectories 


(50) yl" = (az + yay)” + ayy’. 


By this, it is seen that for a given system of * curvature trajectories, the 
function a is completely determined except for an additive constant. Since 
B is entirely arbitrary, we therefore discover the validity of our theorem. 

12. The generalized systems S, defined by P=k N. In a generalized field 
of force, we may define systems of curves S; which are generalizations of the 
corresponding systems S; for the ordinary positional case (see the Princeton 
Colloquium, pp. 91-94). 

If an arbitrary curve is drawn in a generalized field of force, and the par- 
ticle of unit mass is started along it from one of its points with a given speed, 
the constrained motion along the given curve is determined. The acceleration 
along the curve is given by 7, the tangential component of the force vector. 
So the speed at any point is determined by 


(51) = 2 f Tds. 
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The pressure P (of course normal to the curve, since the curve is considered 
smooth) is given by the formula 


(52) P=7/r—N. 


If we increase the initial speed, the effect is to increase v? by a constant c; 
and hence P changes by the addition of a term c/r. 

If the given curve is a trajectory, the initial speed may be so choser © iat 
the pressure vanishes throughout the motion; that is, trajectories may be 
defined as curves of no constraint. Of course if a different initial speed is used, 
P will be of the form c/r; but, as regards the curves, they are completely 


characterized by P=0. 
The general problem that we wish to consider is to find curves so that the 


pressure P shall be proportional to the normal component WN of the force 
vector. So P=kN. To a given value of & there correspond * such curves. 
The system so obtained will be denoted by S,. 

We note that k=0 gives the system Sp of trajectories and k= © gives 


the system S,, of velocity curves. 
To derive the differential equation of the system S;, we must eliminate 


the speed v from the equations 
(53) v? = (k + 1)rN, vw, = T. 
The result is 
(54) Nr, = nT — rN,, 
where n= 2/(k+1). 

Expanding this, we discover as the differential equation of the system S; 
— = Wet — 62) — 

+ [vv — — 36 — — YW/(1 + Jy". 


We note that »=2 gives the generalized dynamical trajectories, and »=0 
gives the velocity curves. 

THEOREM 13. Any system S;, is represented by a differential equation of the 
type (G) characterized by the geometric Property 1. Conversely, any system of ~* 
curves with the Property | can represent the system S;, with k# @ of «/‘) gen- 
eralized fields of force. 

In terms of the functions a and B defined by equations (6), our system S; 
is represented by the differential equation of the type (G) 


(56) = (az + yay)y” + (ay + 


Thence if 20, the functions a and 8 may be determined up to an arbitrary 
function of two variables. 


(55) 
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We note that this theorem is not valid for velocity curves k= © or n=0. 
In addition to Property I, the type (G) must possess the additional differ- 
ential conditions of second order 


Gy = 2H, + Hy +y'Hw, 


57 
+ ¥Gyy — Gy = Hee t+ 29 Hay + yyy. 
If these conditions are satisfied, then our differential equation of the type (G) 
represents curvature trajectories. These may therefore represent the velocity 
curves of « !+/@) generalized fields of force. im 

13. Curves of constant pressure. We now define curves of constant pres- 
sure in a generalized field of force analogous to those in an ordinary positional 
field of force. (See the Princeton Colloquium, pp. 97-98.) Consider the curves 
corresponding to P=c, where c denotes any constant. The curves obtained 
may be termed curves of constant pressure. Only along such a curve is a con- 
strained motion of a particle possible such that the pressure against the curve 
remains constant. 

For a given value of c, a system of ~* curves is obtained, whose intrinsic 
equation, found by differentiating the relation 


(58) P=7/r—N=c, 

is 

(59) (c+ N)r, = 2T — rN,. 

By this, we see that this system is given by a differential equation of the type 
(G) and therefore possesses the geometric Property I. 


THEOREM 14. For a given value of c, there are ~* pressure curves P=c 
whose differential equation is of the type (G) characterized by the geometric 
Property 1. Conversely, any system of ~* curves with the Property | may repre- 
sent the pressure curves for a preassigned c of ~/) generalized fields of force. 


This result may be deduced from the differential equation of the pressure 
curves P=c, which may be written by means of equations (6) in the form 


[1 + c(1 + 9" = [as + yay] y” 
60) 
+ [B+ ay + 3cy/(1 + 


( 


Thence for a preassigned c, a is determined up to an arbitrary function of two 
variables and f is thereby completely known. 

If all systems S, are combined, we obtain ©‘ curves. Similarly combining 
all pressure curves, we find «‘ curves. Each of these two distinct quadruply- 
infinite families of curves is given by a differential equation of the form 
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(61) ylv on Ay"? + +C, 


where A, B, C are functions of the curvature element (x, y, y’, y’’). (See the 
Princeton Colloquium, pp. 95, 98, 113-115.) 

The system (61) may be characterized geometrically as follows. If for 
each of the ©! curves of this family which pass through a given curvature 
element (x, y, y’, y’’), we construct the osculating conic (five-point contact), 
the locus of the centers of these conics is a conic passing through the given 
point in the given direction. 

Such quadruply-infinite systems of curves (61) were first encountered in 
connection with the systems of extremals in a calculus of variation problem(*). 
These differential equations also arise in connection with the study of special 
types of fields of force which depend upon the position of the point and the 
time only, and in many other connections(?®). 

In conclusion, we note that all our results are valid under the total pro- 
jective group consisting of collineations and correlations. The only contact 
transformations of the plane which preserve differential equations of the type 


(G) are the projective ones. 


(*) Kasner, Systems of extremals in the calculus of variations, Bull. Amer. Math. Soc. vol. 13 


(1907) p. 290. 
(#9) See Kasner, Revista de Matematicas vol. 3 (1942) pp. 7-12. 
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LEGENDRE POLYNOMIALS 


BY 
A. ZYGMUND 


§1 
1. Let m and be any two positive integers such that 
(1.1) m/n = 1+ 4, 


where 6 is positive. It has been proved that for any trigonometric poly- 
nomial 


(1.2) T(0) = (1/2)ao + cos v6 + b, sin v@) 


of order n, 


2 
(1.3) max |7(0)}$ Ai max 
0<0<2r 2m +1 
where the coefficient Aj depends on 6 only('). 
Similarly, for r21 


1 i/r 1 2m r\ l/r 
1.4 T(0) |" d6> Bi ——— T {| 


with Bj depending on 6 only. 
It is the main purpose of this note to show that there exist analogous re- 


sults for ordinary polynomials 
(1.5) Q(x) = cop + 


if instead of the points 27u/(2m+1) we consider the zeros of the Legendre 
polynomial P,,(x), where m satisfies the inequality (1.1). These zeros will be 


denoted by (?) 


Presented to the Society, December 27, 1942; received by the editors December 1, 1942. 

(4) See S. Bernstein, Sur une classe de formules d’interpolation, Bull. Acad. Sci. URSS. (7) 
vol. 4 (1931) pp. 1151-1161. A proof of (1.3) (as well as that of inequality (1.4) below) was also 
given by J. Marcinkiewicz and A. Zygmund, Mean values of trigonometric polynomials, Fund. 
Math. vol. 28 (1937) pp. 131-166, especially p. 148. The authors overlooked the earlier paper 
of Bernstein. 

(?) Of course, the inequalities (1.3) and (1.4) for trigonometric polynomials may be easily 
translated into analogous results for ordinary polynomials, if we take for the xy” the zeros of 
Tchebyscheff’s polynomials. The case of Legendre abscissas requires, however, an entirely dif- 
ferent argument. 
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(m) (m) (m) 


By 7, vS”, - - - , y¥@ we shall denote the Christoffel numbers, that is the 


numbers such that 


(m) 
(1.7) R(x)dx = DO R(x, 
for any polynomial R(x) of degree not exceeding 2m —1. It is very well known 
that these numbers are positive(*). 
THEOREM 1. Suppose that m and n are positive integers, n<m, and that r is 
any real number not less than 1. Then for any polynomial Q(x) of degree n, 


(1.8) max |Q(x)| S$ Km,» max: 


where Kn ,, depends on m and n only. If m and n satisfy (1.1) then 
(1.10) Ku s A; 


with A, depending only on 5. 


2. The proof of the inequalities (1.8) and (1.9) has certain features in 
common with that of (1.3) and (1.4). In the latter case, the main idea of the 
proof was that to a partial sum of a trigonometric Fourier series we may add 
a nonoverlapping group of terms so that the resulting expression is a simple 
linear combination of the first arithmetic means of the Fourier series. In the 
proof of (1.8) and (1.9) we shall apply a similar device to Fourier-Legendre 
series, but instead of the first we shall consider the second arithmetic means 
of the series. 

Let us consider an arbitrary series uo+u,;+ --+ +t%,+ -+-. The partial 
sums and the second Cesaro means of the series we shall denote by s, and a,!’, 
respectively. It is easy to see that, given three non-negative integers n <p <q, 
we can always find constants A, B, C (depending on n, p, g only) such that 


(2.1) Aon’ + Boy’ + — 


contains only terms u, with y >n-+-2. 
For the proof of this remark, we observe that 


(?) See, for example, G. Szegé, Orthogonal polynomials, Amer. Math. Soc. Colloquium 
Publications vol. 23 (1939) p. 46 ff. 
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v= 


and that similar formulas hold for o} and o;’. This gives for the determina- 
tion of the numbers A, B, C the equations 


A+B+C=$1, 


1 1 1 1 1 1 
A B Cc = 0, 
A B C 
+ + =0 
(n+ 1)\(n+2) (p+ 1)(p + 2) (q + 1)(q + 2) 


Hence 
(nm + 1)(m + 2) ba (q + 1)(q + 2) _ _ &+ IP + 2) 
(p — n)(q — n) (q— p)(q— (q — — n) 


The next remark we need is the following theorem of Fejér(*): The second 
Cesaro means of the series 


(‘). 


(2.2) A= 


(2.3) 
v= 0 


are non-negative for —iSxS+1, —1S¢S+1. 
Let us denote these means by K,!’ (x, #) and let a,’’ (x; f) denote the second 
Cesaro means of the Fourier-Legendre series 


(2.4) 4,P,(x) 
of a function f(x) defined in the interval —1sx* +1. Then 
+1 
on (x; f) = (x, dt. 
-1 


Hence 


+1 
max | (x; f) | max | | . Kii' (x, ddt 


(2.5) —1S2S+1 -1 


= max | f(z)|. 
—1s7s+1 


(*) Thus the numbers A, —B, C are positive. 
(*) L. Fejér, Uber die Laplacesche Reihe, Math. Ann. vol. 67 (1909) pp. 76-109. 
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Again, if r21 and 1/r+1/r’=1, then 


+1 


+1 l/r +1 
< {f | f(t) |" Ky (x, { Ky. (x, nat 


so that 


” +1 r 
(2.6) | on (x; f) | sf | f(t) | "Kx (x, édt. 


Integrating both sides of the inequality over the interval —1Sx*<+1,- 
and inverting the order of integration on the right, we get 


+1 +1 
(2.7) | on (x; f) dx sf | | "db. 


3. We shall now prove the following theorem which is in a sense a con- 
verse of the inequality (1.9), and which gives slightly more than we actually 
need for the proof of Theorem 1. 


LemMa. Let r= 1 and let m and n be two positive integers such that nS2m—1. 
Then, for any polynomial R(x) of degree n, 


= ir (m ur +1 I/r 


1 


where the coefficient Lm, depends on m and n only. If 
(3.2) n(1 +6) S 2m —1, 
then 

(3.3) L(1 + 1/8*), 
where L is an absolute constant. 


Proof. Let u be any integer such that n—2<y<2m-—1, and let A*, B*, C* 
denote the numbers defined by the formulas (2.2) with n, p, g replaced by 
n—2, u, 2m—1, respectively(*). Hence, treating R(x) as a Fourier-Legendre 
series, we get 
(3.4) R(x) = A*ono(x; R) + Bto,’(x; R) + C*orm—s(x; R). 

Now let A be any positive integer not exceeding 2m—1, and let us con- 
sider the inequality (2.6) with f replaced by R and m by A. We get 


(*) The argument is valid even if n=0 or n=1, provided we set oan =o.,=0. 
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m +1 m 
(m) (m) r (m) 
-1 


+1 
= f | R(2) | dt, 


+1 
(xe, = | Kil! (x, = 


v=l -1 


Hence, on account of (3.4) and using Minkowski’s inequality, 


l/r 
l/r 


r (m) aly 
+ C ; R) 


< (A* +| B*| 


This proves (3.1) with 


(3.6) Lun = A* +| B*| +C*. 


In order to prove (3.3) we assume that we have equality in (3.2), for this 
only increases 5, and we set 


w= — 2) + (2m — 1))/2] =m —1+ 1/2]. 
Then 
2m —1—(n— 2) >2m—1—n= ni, 
2m —1—p=m— [(n — 1)/2] = m — (1/2)(n — 1) 
> (1/2)(2m — 1 — n) = (1/2)né, 
(n — 2) = m+ [(nm — 1)/2] — (w — 1) m— (1/2)(m — 1) - 1 
= (1/2)(2m — 1 — = (1/2)nd. 
Hence the denominators in (2.2), with n, p, g replaced by »—2, yu, 2m—1, re- 
spectively, all exceed (1/4)m?6?=(1/4)(2m—1)*6?(1+6)-*. The numerators 


are O(m?). From this and (3.6) we deduce (3.3). 
4. We now pass to the proof of (1.9). We note that 


(4.1) {f = max ff 
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for all f with 


+1 1/r’ 
(4.2) | <1. 


For f we may take continuous functions only, or even only polynomials. 
We now observe that, if s,(x; f) denotes the mth partial sum of the Fourier- 


Legendre series of f(x), then 


+1 +1 +1 
(4.3) O(x)f(x)dx = QO(x)sn(x; f)dx = Q(x) T(x)dx, 


where 7(x) is any polynomial such that the mth partial sum of its Fourier- 
Legendre development coincides with s,(x; f). Let us set 


(4.4) T(x) = A'on_o(x; f) + Boy (x; f) + Con(x; f), 
where p is any integer satisfying n—2<p<m, and the numbers A’, B’, C’ 


are defined by (2.2) with , p, g replaced by n—2, p, m, respectively. Since 
Q(x) T(x) is of degree n-+m S2m—1, 


= (m) (m),  (m) 
QO(x)T(x)dx E(x, )T(x, y 

m m «m))! 
{Sloe 


Applying Lemma 1 to the last sum and denoting by L’ the least upper bound 
of the sequence Lmm (L’ is finite, on account of (3.3)), we get 


m l/r’ 
{ T(x”) I" 
+1 l/r’ +1 ifr? 
s f | T(x) | s uy f | T(x) | ax} 
| 
+1 +1 iow 
f +| B’| {f | oy | 
| 
Ur’ 
+c} f | om | ax\ ] 


+1 
< L(A’ +| B’| +c} f ax\ Knn, 
-1 


(4.5) 


Ir’ 


[July 
with 
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(4.6) Kan = L'(A’ +| B’| +C’. 
From this, (4.1), (4.3), and (4.5) we deduce (1.9). It may be added that, if 
r =1, the above argument does not require Lemma 1. What it uses is the triv- 


ial inequality. 
(m) 
max |Q(x,- 
y=1,°°*,m 


)| max | Q(x)| 


(which of course is contained in (3.1) for r=+ ©). 
The proof of the inequality (1.8) is essentially the same as that of (1.9). 
Instead of (4.1) and (4.2) we use the relation 


+1 +1 
| Q(x) | = man Q(x)f(x)dx, for | f| dx 1. 


The rest of the argument is unchanged. 
5. In order to prove (1.10), we take 


= — 2) + m)/2] = + m)/2] — 1. 
Then (the argument is analogous to (3.7)) 
m—(n—2)>m—n2 ni, 
m—p=m— [(n+m)/2])+1>m— (n+m)/2 
= (1/2)(m — m) (1/2)né, 
— — 2) = 
= (1/2)(m — m) = (1/2)né, 


so that the denominators of A’, B’, C’ are not less than (1/4)m?5*=(1/4)m? 
-(1+6)-*6?, assuming, as we may, that #=mn(1+5). Since the numerators are 
O(m*), we see that, assuming (1.1) 


(5.1) Ku» K(1 +6), 
where K is an absolute constant. This concludes the proof of Theorem 1. 
§2 


6. The argument used in §1 may be applied to some other problems. As 
an example, we shall prove the following: 


THEOREM 2. Let r2=1 and let m and n be any two positive integers such that 
m=n. Then every polynomial R(x) of degree n may be represented in the form 


(6.1) R(x) = Ri(x) — R2(x), 
where R, and Rz are polynomials of degrees at most m, are non-negative for 
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—1sx5+1, and satisfy the inequalities 
+1 l/r +1 
(6.2) Ri(a)ax} < f | R(x) | 


1 
with Km.» depending on m and n only. If m and n satisfy (1.1), then 
(6.3) K(1 + 
where K is an absolute constant. 

Proof. Let us set 
R(x) = Rt(x) — R(x), 


where 


R+(x) = max { R(x), 0}, R-(x) = max {— R(x), 0}, 


so that 
0< <|R(x)|, OF R(x) S| R(x)I, 
< R(x) |"dx, < R(x) |"dx. 


1 


Let p be any integer satisfying the inequalities 
n—-2<p<m 
and let A’, B’, C’ be defined by the formulas (2.2) with n, p, g replaced by 
n—2, p, m, respectively. Let us consider the expressions 


(6.4) T*(x) = A’ono(x; Rt) + (x; Rt) + C'on(x; Rt), 
(6.5) = R-) + (x; R-) + C'on(x; R-) 
whose difference is 


A'o,_2(x; R) + B’o, (x; R) + C'on(x; R) = R(x), 


and let us set 
= A’ x; Rt) - (x; R-) + x; Rt), 
= (x; R') — (x; Rt) + C'on(x; R-). 
The polynomials R, and R, are non-negative over the interval -1Sx3+1 
and their degrees do not exceed m. Clearly (6.1) is satisfied, since 
Ri(x) — Ro(x) = T*(x) — = R(x). 


Moreover, using Minkowski’s inequality, and taking into account (2.7), we get 
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+1 “a 
{f Rash f Tn—2(%; 
-1 
+1 on 
+| B’| {f (x; 
-1 
+1 
+ ot f Cu (23 
-1 
+1 
Kan} 


Kan = A’ +| +C’. 


with 


Similarly we prove (6.2) for ¢=2. 

A comparison of this argument with that of section 4 shows that the con- 
stant K,,,, of Theorem 2 is the same as the K,,,, of Theorem 1. This immedi- 
ately gives the inequality (6.2) (cf. (5.1)). 

7. It is clear that an argument similar to that of the preceding section 
(and even slightly simpler, because we may use the first arithmetic means in- 
stead of the second) gives an analogue of Theorem 2 for trigonometric poly- 
nomials 


(7.1) = (1/2)ao + (a, cos + b, sin v@). 


THEOREM 3. Let r2=1 and let n<m be two positive integers. Every trigono- 
metric polynomial (7.1) of order n may be represented in the form 


(7.2) = Ui(6) — U2(8), 


_ where U, and Uz are non-negative trigonometric polynomials of order at most m, 
satisfying the inequalities 


(7.3) { viae\ < 


with K,,, depending on m and n only. If m and n satisfy (1.1), then 
(7.4) K'(1 +5") 


where K' is an absolute constant. 


The proof may be condensed at some points. Given a function f(@) of 
period 27 we shall denote by s,(0; f) and o, (0; f) the partial sums and the 
first arithmetic means of the Fourier series of f. If <q are two non-negative 
integers, then the linear combination 


vel 
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written as a trigonometric polynomial, has its mth partial sum equal to 
s,(0; f). Similarly as in section 6 we introduce the functions U+(@) and U~-(@) 


and the polynomials 


m+1 


n 

= ——— U- (8; U*), 
U;(@) 1( U+) 
Ux(¢) = —— U*) + 


These polynomials are non-negative, of degree at most m, and satisfy (7.2). 
Taking into account that, for any function f(@), 


2r 2s l/r 
{ f | <{f | 40) lsrs+t+o, 
0 0 


we get (7.3) where 


This yields (7.4), with K’ =2, provided that m and m satisfy (1.1) 
§3 


8. In the proof of Theorem 1, instead of the second arithmetic means 
o,' (x; f) of the Fourier-Legendre series of the function f, we could have used 
the first arithmetic means a, (x; f) of that series. The argument would have 
been similar, for although the first arithmetic means K,! (x, #) of the series 
(2.3) are not positive, they have the property that the integral 


+1 

(8.1) f | Ka(a, | de 

is bounded in m and x(’), and that is the only property of the kernel that was 
required in the proof. The advantage of using the second arithmetic means is 
that the positiveness of the kernel K,/" (x, #) is more elementary than the 
boundedness of the integral (8.1). Moreover, for the proof of Theorem 2 the 
fact that we used a positive kernel was essential. On the other hand, using 
the kernel K,! (x, 4) we might obtain a better estimate for K,,, than the 
inequality (5.1). Namely, we might obtain that K,,,,=O(8-') for —-+0. 
That this estimate is still too crude is shown by the following: 


(7) See T. H. Gronwall, Uber die Laplacesche Reihe, Math. Ann. vol. 74 (1913) pp. 213-270. 


mt+n—1 2m + 1 
Ka. = < > 
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THEOREM 4. Suppose that 0<6531, and that A; is the least number satisfy- 
ing (1.10). Then 


(8.2) A; 
where M denotes an absolute constant. 


It may be observed that the order 5—'/? in (8.2) cannot be improved. For 
if, for a given m, we take the least possible value of m, namely m=n-+-1, then 
6=1/n. On the other hand, the polynomial P(x) is then determined uniquely 
by its values at the points (1.6). It is, however, well known that a polynomial 
P(x) of degree m, which is absolutely less than 1 at the points (1.6) 
where m=n-+1, may be of the order m'/? at some points of the interval 
—1sx35+1(®). Hence 5-’’ is the right order of the coefficient A+. 

In the proof of Theorem 3 it is slightly more preferable to use the kernel 
K,! (x, #) instead of K,!’ (x, #). However we shall not appeal to the fact that 
the integral (8.1) is bounded (on the contrary, our argument would easily 
prove that fact). 

We shall confine our attention to the inequality (1.10). A practically iden- 
tical argument gives the inequality (1.9), which also may be directly deduced 
from (1.10) by making r tend to + «. 

Let us revert to the proof of Theorem 1 (see section 4). We start with the 
inequalities (4.1) and (4.2), where f denotes a polynomial. In the formula 
(4.3) we replace T(x) by the polynomial 


n m+i 
(8.3) (a) = 6-48; f) + Om( x; f) 


m—-nt+i1 —n+1 
which is of degree at most m, and which has the property that 
7) = Sa(x; f). 


Instead of (4.5) we get 


(m) 


+1 m 


where, with the notation of section 4, 


-1 
Theorem 4 will have been established, therefore, when we have shown that 


+1 l/r’ 
(8.6) { f | r(x) < M's", 
-1 


where M’ denotes an absolute constant. 
(*) See, for example, G. Szegé, loc. cit. pp. 328-332. 
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9. We observe that 


+1 
(9.1) -f (x, 


where 


n m+ 1 
(9.2) A(x, t) K,-1(2, t) 
— 


K..(x, 
m—-n+1 n+1 


We shall find for the integral 


+1 
(9.3) f | H(x, t)| dt 
a majorant independent of x. For this purpose let us consider the first arith- 
metic means of the series 


(9.4) > (» + 1/2)P,(cos 7). 


v=0 


These means will be denoted by x,(y). Using the familiar formula 


(9.5) P,(cos 0) P,(cos = f P(cos y)d¢, 


0 
where cos y =cos @ cos #’+sin @ sin 0’ cos ¢, and setting 
x = cos 6, t = cos 6’, 


we find that 
+1 Qe 

(9.6) f | H(x, de -f h(-y)d@ | sin <ff | | dw 
-1 0 0 8 


where S denotes the unit sphere and dw is the element of area on S. It is 
clear that the last integral, which we shall denote by J, is independent of @, 
so that we may take for example @=0. Thus 6’ = and 


(9.7) I = f | | sin ydy. 
0 


Let us now revert to the formula (9.1). Hélder’s inequality gives 
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+1 l/r’ +1 l/r 
|r(x)| < {f | {f | A(x, | ash 


+1 Wr’ 
< f | f(t) H(x, t) | as\ 


+1 +1 +1 
'dx < d A(x, d 


+1 +1 
rs f | f | H(x, t) | dsI", 


on account of (9.6). This fact and (8.6) show that Theorem 4 will be a conse- 
quence of the inequality 


(9.8) I < M's? 
which we are now going to prove. 


10. We shall use the fact, first pointed out by Fejér(*), that the series 
(9.4) is a Cauchy product of the two series 


v=0 


Taking into account the familiar formula of Mehler (?°) 


2 i 1/2)é 
P,(cos y) = — f dt, 
x J, [2(cos y — cos ]*/? 


we see that the vth partial sum of the first series (10.1) is equal to 


V,, 


1 — cos (v + 1)é 
2sin (1/2)¢[2(cos y — cos #) 


say. Since the vth partial sum of the second series (10.1) is the Dirichlet 


kernel D,(y) =sin (v+1/2)y/2 sin (1/2), the first arithmetic means x,(v¥) 
of the series (9.4) are given by the formula 


N 
(N + 1)xy(v) = 


2sin 2 sin (t/2)[2(cos y — cos #) 


(*) Fejér, loc. cit. 
(#°) See, for example, G. Szegi, loc. cit. p. 87. 


BOSTON UNIVERSITY 
COLLEGE OF LIBERAL ARTS 
LIBRARY 


51 
so that 
| 


52 A. ZYGMUND - [July 


Let us now represent here the products of sines by cosines as differences 
of sines, and let us apply the formulas for the sums of sines of an arithmetic 
progression. The last formula ‘may then be written 


1 ay (y) + an + an 
7 


2 
(10.2) (N+ 1)xy(y) = 2 sin (¢/2)[2(cos y — cos #) }*/? 


2sin 
where 
mn ) 1 — cos (N + 1)y 
a = ’ 
2 sin y/2 
(2) 1 cos (WN + — cos (VN + 1)y 
av (7,4) = ——- 
2 2 sin (t — y)/2 
(3) 1 cos (WN + 1)# — cos (NW + 


ow) 2 2 sin (¢ + y)/2 


Let us now consider the formula (9.5) and let us set 
m—(n—1)=, 


so that 
(10. 3) hy) = X-*[(m + 1)xm — (m — d+ 1)xm-a]. 
Hence, if in (10.2) we substitute for N the values m and m—\, and subtract 


the results, we get 


h(y) = (y) + (y) + h(y), 
where 


(i) (i) 


(10.4) = dt 
2siny/2J, 2 sin (t/2)[2(cos y — cos 
(¢ = 1, 2, 3). 


We shall show that 


(10.5) f | (y) | sin ydy S C(m/d)"2, 


where C is an absolute constant. (In the rest of this paper we shall denote 
by C absolute constants, not always the same at every occurrence.) Since the 
integral I (cf. (9.7)) does not exceed the sum of the integrals (10.5) multi- 


plied by 2x, and since 
A>m—n2= ni = mi/?, 


so that m/A<2/6, the inequalities (10.5) will imply (9.8). 
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11. Let us take for example 1=2. We note that 


| | sin -y cos (y/2) 


1/2 (2) 


f (2 sin ¢/2) [2 sin (¢ — y)/2-2 sin (¢+ y)/2] — dt, 
7 

and we break up the integral on the right into three integrals extended re- 

spectively over the intervals (y, y+1/m), (y+1/m, y+1/d), (y+1/, 

Correspondingly 


(11.1) | | sin y S Ri(y) + Rely) + Ra(y), 


where 


R; = cos f Rz = cos (y/2) 
7 Y+1/m 
Rs = cos (y/2) 


Observing that 
sin u = (2/r)u (0S 72/2), 
‘sin (y + cos (y/2) sin (OS St 
we find that 


R:(y) S$ Ch" — y)-8/2d¢t = Cr. 
Again, 


y+1/r 


f 


y+i/r 


= 2-8/2, 


Thus we got the following two inequalities for R;: 
(11.2) Rx(y) S Or, S 


Taking into account the structure of the numerator of a, we find 
that 


(11.3) Rely) on f { | sin 4/2| +| sin — 

If we replace each sine on the right by 1, we get 


(¢*) If any of these intervals partly projects outside (0, x) we integrate over only the part 
contained in (0, x). 


53 


A. ZYGMUND ? [July 


Ry) Cr f — f (¢— 


Y+1/m y+1/m 
S Chm 24-972, 
If we replace the sines by the corresponding angles and observe that y Si, 


we get 


(11.4) Rv) = C — S Cm 2-1/2, 
Y+1/m 


Thus 
(11.5) R2(y) S S 


Finally, we observe that 


7 


and that the numerator of a2) is 


sin (1/2)(m + 1)(¢ — y) sin (1/2)(m + 1)(¢ + +), 


so that the numerator of a?’ —a®), is 


[sin (1/2)(m+1)(¢—-y) —sin (1/2)(m—A+1)(t—7)] sin (1/2)(m+1)(¢+y) 
+ [sin (1/2)(m+1)(t+-)—sin (1/2)(m—d+1)(t+7)] 
‘sin (1/2)(m—d+1)(¢—7) 
=2 sin (1/4)A(¢—-) cos (1/2)(m+1—2/2)(t—vy) 
-sin (1/2)(m+1)(t+~7) 
+2 sin (1/4)A(¢+-) cos (1/2)(m+1—2/2)(t+y) 
-sin (1/2)(m—d+1)(t—7). 


If we replace each sine on the right by the corresponding angle, the cosines 
by 1, and if we note that y+/3S 22, we find that 


Ri(y) Cm f — y) f (¢ — 
7 


(11.6) 


< Cm 
However, we may also note that the right-hand side of (11.6) does not ex- 


ceed 
C[A(é — y) + m(t — y)] S Cm(t 


so that 
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yt1/m 
Ri(y) Cm) f — Cm 
Thus 


The inequalities we have proved show that 
(11.8) f Ri(y)dy C(m/d)*? (k = 1, 2,°3). 
0 


For k=3, this follows immediately if we break up the integral (11.8) into 
two, extended over the intervals 0SyS1/A and 1/ASySr, and apply re- 
spectively the inequalities (11.2): 


1/d 
f Rsdy sf Crdy + C S C(m/d)*/2, 
0 0 

Similarly, using the inequalities (11.4) and (11.7), we settle the cases k=2, 
3 of (11.8). Taking into account the definition of the functions R;, we deduce 
from (11.8) the validity of (10.5) for ¢=2. 

The case 4 =3 is similar to that of 1=2, the only difference being that the 
factor 2 sin (¢—~)/2 in the denominator of a is replaced by 2 sin (¢+-)/2, 
which is larger, on account of the obvious inequality 


(¢— y)/2 S (¢+ y)/2 — (¢ — y)/2. 


Thus the integral (10.5) when 7=3 is less than the integral with 1=2. 
The case ¢=1 is also easier than that of 1=2 but is slightly different. We 
observe that 


cos (y/2) f sin ¢/2)-"[2(cos y — cos #) 


< c(f + 
7 y+1/r 


at | 


= 


Of the two terms in the last square brackets, the first prevails for y<1/\, 
the second for y21/A. Hence, if we denote the left-hand side of (11.9) by 
E(y), we have the following inequalities 


(11.10)  E(y) if by Ely) SOA”, if Ay 2 1. 


Obvicusly 
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| (y) | sin y SCA“"E(y) | cos (m+1)y7—cos (m—d+1)y | /2 sin y/2 
<Cd-E(7) | sin (Ay/2) sin (m+1—2/2)y| /2 sin y/2. 
The right-hand side here may only be increased if the product of sines in the 
denominator is replaced by one of the following numbers: \(m+1—d/2)y2/2, 
dy/2, 1. The first will be the most advantageous in the interval 0Sy S1/m, 


the second for 1/mSy31/\, the third for y21/A. Taking into account 
(11.10), we deduce from (11.11) that | h(y)| sin ¥ is majorized by the follow- 


ing expressions 
if OS yS1/m; if 1/m, 


(11.11) 


and these inequalities give (10.5) for i=1. This completes the proof of Theo- 


rem 4, 
12. It may be of some interest to point out that the constants Aj and Bj, 


of the inequalities (1.3) and (1.4) are O(log 1/5) as 6—>+0. For the inequality 
(1.3) this is proved in Bernstein’s paper cited above. The inequality for Bj 
follows almost immediately from the argument used in the paper by Marcin- 
kiewicz and Zygmund('), provided we use the fact that, if 


K,() = (m + 1)-* {sin® (m + 1)t/2}/2 sin? ¢/2 


denotes Fejér’s kernel, and 0<A<m, then 


f | (m + 1)Km—s(t) — (m — 1)Kma(é) | dt = O(1 + log m/). 


For the proof of this inequality it is sufficient to split up this integral into 
three, extended respectively over the intervals (0, 1/m), (1/m, 1/A), (1/A, 7), 
and to observe that in the first integral the integrand is O(m), in the second 
it is O(1/#), and in the third O(1/\#*). 


Mt. HoLyoxe COLLEGE, 
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FLAT SPACE CONGRUENCES OF ORDER ONE IN [n] 


BY 
EDWIN J. PURCELL 


By an [n—k]-congruence of order one in [n], k<m, is meant an alge- 
braic *-system of [n—k]’s in n-dimensional projective space such that one 
and only one [n—k] of the system passes through an arbitrary point of [rn]. 
A point on more than one [n—k] of the congruence is called a fundamental 
point. A fundamental point fails to determine any [n—k] of the congruence. 

Although some special cases of flat space congruences have been studied, 
there is no general theory in the literature("). Line congruences of order one 
in [3] were investigated by E. Kummer(?) and this was completed by 
R. Sturm(*). G. Marletta(**) classified line congruences of order one in [4] 
but his methods were not suitable for extension to [n]. Some line congru- 
ences in [5] were discussed by M. Sgroi(*). G. Aprile(”) collected the known 
examples of line congruences of order one in [n]; these are all special cases 
of the more general congruences defined in the present paper and it will be 
shown in §8 that his construction for a line congruence in [n] with irreducible 
locus of fundamental points is a restricted case of one of ours(®). 

Probably not all possible flat space congruences of order one are included 
in what follows. The line congruence in [4] consisting of the trisecants of 
the F*, projection of the Veronese surface in [5], and the two line congruences 
in [5] consisting of the bisecants of a ruled F* or of the bisecants of the del 
Pezzo surface, do not appear to belong to our class of congruences. 

1. Classification. On each [n—&] of our [n—k]-congruences of order one 


Presented to the Society, November 28, 1942; received by the editors April 6, 1942, and, 
in revised form, January 6, 1943. 

(*) Encyklopidie der Mathematischen Wissenschaften vol. III C 7, p. 964. 

(?) E. Kummer, Uber die algebraischen Strahlensysteme, in besondere die der ersten und 
sweiten Ordnung, Berlin, Abhandlungen der Preussischen Akademie der Wissenschaften, 1866. 

(*) R. Sturm, Die Gebilde ersten und sweiten Grades der Liniengeometrie in synthetischer 
Behandlung, Leipzig, vol. 2, 1893. 

(*) G. Marletta, Sus complessi di rette del primo ordine dello spazio a quattro dimensiont, 
Rend. Circ. Mat. Palermo vol. 28 (1909). 

(®) Marletta, Sopra i complessi di rette d’ ordine uno dell S,, Atti Accademia Gioenia Catania, 
(5) vol. 3 (1909). 

(*) M. Sgroi, Sus 4-complessi di rette d’ordine uno neli’S,, Atti Accademia Gioenia Catania 
(5) vol. 15 (1925). 

(7) G. Aprile, Giornale di Matematica vol. 70 (1932) pp. 196-216. 

(*) After the present paper had been sent to the editor, I received from Professor Virgil 
Snyder an offprint of his Cremona involutions belonging to the Bordiga surface in [4], Revista 
de Tucum4n (A) vol. 2 (1941) pp. 203-210, in which he describes a line congruence of order 
one in [x] consisting of the (n —1)-secants of the generalized Bordiga. This is type (n —1). in 
my classification. 
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in [n] there is a V%_,_,, all of whose points are fundamental. (Vi means 
points.) This variety may be irreducible or composite. In the latter case the 
dimension of each component is n—k—1 while their orders may be any posi- 
tive integers Ai, As, - - - , Aw, whose sum is k. 

The type symbol (A1A2A3 - - - Aw)n will be used to indicate an [n—k]- 
congruence of order one in [n] whose generic [n—k] has on it w distinct 
fundamental varieties of orders A1, Az, ---, Aw, respectively. When the 
meaning of the type symbol becomes comp etely evident (§4), it will be seen 
that permutation of the A; in the type symbol changes the type. Therefore 
there are 2*-' types of [n—k|-congruences of order one in [n], for which the fun- 
damental loci on a generic [n—k| are all distinct. 

Since some or all of the fundamental varieties on a generic [n—k] of the 
congruence may be coincident (§6), we divide [n—k]-congruences of order 
one in [m] into those for which these components are all distinct and those 
for which some or all are coincident. Until §6, congruences in the former 
category will occupy us. 

2. Fixed base. The equations 


in which the A; are parameters and the x;; are linear forms in the homogeneous 
projective point coordinates xo, x1, - - - , x, of [m], having arbitrary constant 
coefficients, represent an ©*-system of [n—r]'s in [mn]. 

A necessary and sufficient condition for an [n—r] of this system to pass 
through an arbitrary point P of [7] is 


(2.2) 
+ + r+1 = 


where ?,;; is the result of substituting the coordinates of P in x;;. 
The symbol 


411° ° * X1r41 


(2.3) 


means r+1 simultaneous equations X,=0 (h=1, 2, ---,r+1), where X; is 
(—1)**! times the determinant formed from the matrix in (2.3) by omitting 


the kth column. 
Denoting by P, the result of substituting ,; for x;; in X,, we have from 


(2.2) 


+ = 9, 
|_| 
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Therefore 
Pity + = 
(2.4) 


are the equations of the [n—r] of system (2.1) through point P. 
Now a necessary condition for any other point Q to lie on (2.4), the [n—r] 
through P, is 


(2.5) 
Pigri + + r41 0, 


where the q;; are obtained by substituting the coordinates of Q in the +;;. 
From which P,:P2: :P,y41=Q1:Qe: :Q,41. Thus every point on (2.4) 
determines that same [n—r], and the ratios P,:P2: - - - :P,4; are unique for 
the [n—r] of the system (2.1) passing through P. 

Since (2.4) defines one [n—r] through an arbitrary point P of [n] and 
every point of (2.4) determines that same [n—r], it follows that the totality 
of [n—1]'s given by (2.4) for all points P of [n] form an [n—r]-congruence of 


order one in [n]. 

(2.3) is a determinantal locus(*). Since it will appear that (2.3) serves as 
a carrier or base for flat space congruences of order one, (2.3) will be called a 
fixed base. 

Indicate the fixed base (2.3) by ¢4,. It is a variety of dimension nm —2 and 
order C,+1,2(2°). Degenerate cases, for which the order of the fixed base is 
lower, will be treated in a separate note. 

The points of -A; are fundamental points for the [n—r]-congruence of order 
one in [n] whose generic [n—r] is given by (2.4). 

For a general point P not on cA, at least one of the P; (j7=1,2, - - -, r+1) 
is different from zero. For convenience, let P,,:0. If all the other P; are 
zero, it is obvious that the intersection of the [n—r] through P, given by 
(2.4), with the fixed base (2.3), is the same as the intersection of (2.4) with 


“11 “ee 
(2.6) = 


If some other P;, say P;, is also different from zero, (2.6) can be rewritten 


(*) T. G. Room, The geometry of determinantal loci, Cambridge, 1938, p. 33. 
(}°) Room, loc. cit. pp. 34, 43. 


| 
Pigu + + = 0, 
Vir 
Xrr 
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P; 


Using (2.4), this becomes 


%1 i+1 °° * 


which is Xi/P; =0. 

Since the selection of P,,; as the nonzero P; was arbitrary, it follows that 
the intersection of the [n—r] through P, given by (2.4), with the fixed base (2.3), 
is the same as the intersection of (2.4) with any X;=0 for which P;+0. 

This intersection is a determinantal primal in the space [n —r] through P 
and is of order r. Its equation is formed by imposing the r linear conditions 
(2.4) on the running coordinates xo, x1, - - - , X, in each element x;; of X;=0. 

In the fixed base (2.3), let x;;=) ?_9@s;0x:, where a;;, are constants. Equa- 
tions (2.1) may be rewritten 


+ + AinXn 0, 
(2.7) 


+ + = O, 


where In general, for each set of values - , these 
equations are independent and represent an [n—r] of the congruence based 
on (2.3). A necessary and sufficient condition for equations (2.7) ‘to be de- 
pendent is that the rank of the matrix 


(2.8) 
Aro Xen 


be m (m<r). In such event, (2.8) represents a variety A of dimension(!") 
r—m (n+1—r+m) in a space [r] whose generic point has coordinates 
(Ax, - «+, Aezi). The order of A is given by a formula to Segre(!*). To each 
point of A corresponds a set of values Ax, - - - , Ar41 which determine in (2.1), 
not an [n—r], but an [n—m]. 

For example, the type (3), line congruence of order one in [4] consists 
of the trisecants of a Bordiga sextic surface. This surface is the fixed base -/,, 
all of whose points are fundamental. But there are 10 planes in [4], each of 
which cuts the Bordiga in a cubic curve. Through a general point of such a 


(#) Room, loc. cit. p. 36. 
(#2) Room, loc. cit. p. 42. 


|=0. 
Xrr 

P; 
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plane a pencil of lines can be drawn, each of which is a trisecant of the 
Bordiga surface. The equations of the 10 planes are obtained by substituting 
in (2.1) values of Xx, - - - , A-41, which make the rank of matrix (2.8) be m =2. 

Let #;,:,-. .«, =t() be the r-row minors of the matrix (2.8). These minors ¢,;) 
are forms in - - - , An [n—1] with coordinates belongs 
to the congruence if and only if its coordinates i.) satisfy all homogeneous alge- 
braic relations which are satisfied identically by the t.:); that is, if and only 
if F( +++, * )=0 (tdentically in dy, do, +++, implies 

3. Dependent base. When r=k in §2, there results an [n —k]-congruence 
of order one in [n] for which the fundamental variety on a generic [n—k] 
is irreducible. 

When r<k, other [n—&]-congruences of order one may be constructed as 
follows. For these, the fundamental variety on a generic [n —k] is composite. 

Consider 


(2) (2) 
411 * Xe 


(3.1) = 0, 
(2) (2) 
* 


where x1?’ =) t-oonee, in which a% are forms of order ox (any positive in- 


teger or zero) in the coordinates P, (h=1, 2, -- - , r+1) of (2.4). 

(3.1) will be called a dependent base. Since this concept seems to be new, 
it is emphasized that the x? are linear forms in the projective point coordi- 
nates of [n], having coefficients which are forms of order a2, in the P;, Ps, - - +, Press 
of the [n—1] through arbitrary P. 

Geometrically, an arbitrary point P of [n] determines the [n—r] of (2.4) 
through it whose P;, - - - , P,,; make (3.1) a fixed base with elements that 
are linear forms in the running coordinates %o, - - - , X,, with constant coeffi- 
cients determined by P;, - --, P,41. Thus there is defined in the [n—r] 
through P an [n—r—s]-congruence of order one whose [n—r—s] through P 
is given by equations (2.4) along with the equations 


(2) (2) (2) (2) 
Py ay + = 9, 


(3.2) 


(2) (2) (2) (2) 
Py + 041 = 0, 


in which P® is obtained by substituting the coordinates of P in X?, and X}? 
is (— 1) +! times the determinant formed from the matrix of (3.1) by omitting 
the jth column. 

The ratios P®: P2: - - - :P®, are characteristic of the [n—s] through P, 
given by (3.2). 
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If r+s=k, equations (2.4) and (3.2) together define a unique [n—k] 
through a generic point P of [], and the totality of these [n —k]’s constitute 
an [n—k]-congruence of order one in [n]. The fundamental variety on each 
[n—k] of the congruence is composite, consisting of a Vz_,%-1 in which the 
[n—k] intersects (2.3) and a V3_,-1 in which the [n—k] intersects (3.2). 

If r+s<k, another dependent base may be constructed whose elements 
are linear forms in Xo, %1, - - - , X, with coefficients that are homogeneous of 
order o3, in the P;, - - - , P,41 of (2.4) and also homogeneous of order a3 in 
the coordinates P?, - - - , P®, of (3.2) (os and og any positive integers or 
zero). 

This process may be continued until the number of equations defining the 
generic flat space of the congruence is equal to k. 

The effect of these dependent bases is to associate with each element of a 
flat space congruence of order one in [”], in the most general possible manner, 
a unique congruence within that element. 

4. Type (A: A2--- Aw)n. We are now prepared to describe completely 
the general type (A1A2- ~:~ Aw)n, where the A; are any positive integers 
whose sum is k<n. 

Associated with the positive integer A,=r in first position in the type 
symbol is the fixed base 


Xi 


(4.1) 


* Xe ti | 


in which the x;; are linear forms in xo, x1, - - - , x, having arbitrary constant 
coefficients. 
Then (§2), 
+ Pr = 0, 


(4.2) 


where the P; are obtained by substituting the coordinates of an arbitrary 
point P of [n] in X;, and X; is (—1) ‘+! times the determinant formed from 
the matrix of (4.1) by omitting the ith column, represent an [n—r] through P. 

Associated with the positive integer A,=s in hth position in the type 
symbol (4 =2, 3, - - - , w—1) is the dependent base 


(h) (hk) 
411 °° * 


(4.3) = 0, 
(h) (h) 


where xf) = >°?_,a{x,, in which af? are homogeneous of order om in the 
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P;,--- , Pr of the flat space (4.2) through P, based on the fixed base 
(4.1) associated with A. The ef are also homogeneous of order Gam in the 
P™, P&™,---+ of the flat space through P based on the dependent base 
associated with A», (m=2, 3,---,h—1). The om and oim are any positive 
integers or zero. 
Then 
Py + +++ + Portion = 0, 


(h) (A) (kh) (A) 


Pi 41 + = 0, 
where P? is the result obtained by substituting the coordinates of point P 
in X®, and X® is (—1)4+! times the determinant formed from the matrix of 
(4.3) by omitting the dth column, represents an [n—s] through P. 
When the type symbol contains more than one integer, there is associated 
with the integer A,.=#, in last position in the type symbol, the terminal base 


(w w 


(4.5) = 0, 
(w) (w) 


where x=) in which are homogeneous of order tj in 


P,, Ps, - - - , homogeneous of order in PY, - - - , homogeneous of 
order 7;3 in - - - , and so on (j=1, 2,---,¢+1). The are any 


positive integers or zero. 
A terminal base differs from a dependent base in that the order of the 
forms comprising the coefficients of the linear forms in any particular column 
of the matrix of (4.5) may differ from the order of those in any other column. 
This additional generality is not ordinarily possible in the dependent base 
associated with A, for h<w, since it would cause loss of homogeneity in 
the 
From (4.5) we have 
(w) (w) (w) (w) 
Py tu + = 9, 
(w) (w) (w) (w) 
Py ta + = 0, 
in which the P™ are obtained by substituting the coordinates of P in X%”, 
and X‘ is (—1)#+! times the determinant formed by omitting the jth column 
from the matrix of (4.5). 
Equations (4.6) represent an [n—#] through point P. 
Equations (4.2), (4.4) (for h=2, 3, - - -, w—1) and (4.6) together repre- 
sent a unique [n—k] through an arbitrary point P of [n]. The totality of 
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such [n—k]’s, for all points P of [n], form an [n—k]-congruence of order 
one in [n], whose type symbol is (A1A2 - - - Aw)n- 

5. Fundamental points. For the [n—k]-congruence of order one in [n] 
whose type symbol is (A1A2 - - - Aw)n, there are w varieties As, - - -,Aw, 
each of dimension »—2, all of whose points are fundamental. Through each 
point of any 4; there pass infinitely many [n—k]’s. 

A; is the fixed base (4.1). We have seen (§2) that c/, is a variety of di- 
mension —2 and order a; = C,41,2. 

The dependent base associated with A; of the type symbol is 


(2) (2) 
411 °° * 


(5.1) 
(2) (2) 
where x? are linear forms in xo, - - - , Xn, whose coefficients are forms of order 
on in » 
Denote by xi and FP the result of changing to %o, - - - , x, the coordi- 
nates of P wherever they appear in x?’ and F,”, respectively. Now 


(2) (2) 
°° * Xi 


(2) (2) 


is a variety of dimension »—2. Its order is the number of points in which 
it intersects a general plane of [mn]. Since (5.2) has no special position 
relative to the frame of reference, its order will be given by the number 
of points in which it intersects the particular plane 7, whose equations are 
x3=x,= --- =x,=0. Since each element x? of the matrix (5.2) is homo- 
geneous of degree on in Fi, - - - , F,41, every point of ¢/4; lies on (5.2). That 
is, (5.2) is a composite variety of dimension »—2, consisting of <4; and a 
residual which is 43. We seek the order of -43. The order of ¢/; is the number 
of points, not on e/;, in which the plane x intersects (5.2). 

Denote by and the result obtained by equating to zero 
Xs, ++, x, in x, F;, and F, respectively. Salmon('*) has pointed out 
that the locus 


(2) (2) 


(5.3) 
(2) (2) 


(#) G. Salmon, Modern higher algebra, 4th edition, Dublin, 1885, p. 287. 
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is the intersection of ° =0 and 2 =0 after rejecting those points of the 


locus 


(2) 2) 


= 

which do not lie on 6° =0 and ®% =0. (Notice that the rows of this latter 
matrix are the columns of (5.3) common to ®” and 42.) 

Let s=1 in (5.3). Then &® =0 is a plane curve of order ron+1. Since &? 
is homogeneous of degree in ®,, - - - , and --- = 
=0 is C,41,2 points, each of these points is o:-fold on the plane curve ®°’ =0. 
Similarly, 62 =0 is a plane curve of order rax,+ 1 on which each of the C,+1,2 
points, --- =9,,,=0, is on-fold. Therefore ©? =0 and 62) =0 
intersect in (r¢m+1)? points, among which the C,+:,2 points count C,41,20%. 
That is, when s =1, the number of points, not on ¢4;, in which (5.1) intersects 
is This is the order of when s = 1. 

Now consider the case s=2 in (5.3). &? =0 is a plane curve of order 
2(ro2+1). Since is homogeneous of degree in - - - , each of 
the C,412 points, - - - = %,,,=0, is 20:-fold on the plane curve 
6? =0. Similarly, 62 =0 is a plane curve of order 2(re2+1), on which each 
of the C,4:,2 points is 6° =0 and =0 intersect in 4(ron+1)? 
points in which the points count as 4¢,3C,4:2. That is, =0 and 
2) =0 intersect in 4(roa+1)* —403,C,41,2 points not on We have already 
seen (s=1) that 

|| = 0 
consists of (ro2+1)?—o3,C,41,2 points not on ¢/4;. Therefore, for s=2, (5.3) 
consists of 3{ (rom+1)?—C,41,203,} points not on ¢/;. This is the order of ez 
for s=2. 

By induction it can be shown that the order of ¢4:, for s any positive in- 
teger, is a2 = Coy1,2{ (ron + 1)? — 


The determination of the orders of 43, ei, - - - , A» becomes increasingly 
tedious and does not differ essentially from the above. 
When o,.+0 in the dependent base (4.3), the P®, P{, --- appear 


homogeneously in the coefficients of the linear forms x{)), and this causes <4, 


to intersect «4; in a variety of dimension n—3 (k<h). If, moreover, ox, 0 
(g<k), e/;, intersects 4, in a variety of dimension »—3. But P®, PY, - - - 
appear in the P”, P®, - - - , and therefore also in the coefficients of the linear 
forms and consequently intersects in a variety of dimension n—3. 
If cA, intersects A, in a variety of dimension n—3 and A, intersects A, in a 
variety of dimension n—3 (g<k<h), then A, intersects A, in a variety of di- 
mension n—3. The three varieties of dimension »—3, just mentioned, are 


7 
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distinct. Subject only to this restriction, we cause <4, to intersect -4, ina Va_s 
or not, according as we choose o,, different from, or equal to, zero in con- 
structing our flat space congruence of type (Ai - - - Aw)n. 

6. Coincident fundamental points. For convenience in the type symbol 
(A; ++ Any let - A unique [n—u] 
of the congruence (A1A2 - - - Ax_1), passes through P. This [n—z] intersects 
A; in D; (i<h). D; is a determinantal primal of space [n—x] and is of dimen- 
sion »—u—1 and order A; (§2). 

By means of the dependent base (4.3) there is determined in [n—u] a 
fixed base D, whose matrix contains s rows. 

When s=A; (i<h), D can clearly lie on e4;. When s=A;—1, there are 
two systems of fixed bases, whose matrices have s rows, on D,(!*) and thus D 
can lie on D,. 

Therefore, whenever in the type symbol an integer is equal to, or one less than, 
an integer further to the left, there exists an [n —k|-congruence of order one in [n]} 
with coincidences among the fundamental points. 

In the case of a line congruence having A,.=1 in its type symbol, the 
plane through P of the associated plane congruence (A1Az2 - Aw_1)» inter- 
sects Ae, - - - , Aw_s in. plane curves D;, Dz, - - - , Dy_1, respectively, 
and ¢4,, in a single point. This point can be made to lie on any of the plane 
curves, and therefore for the type symbol (A,A2 Au-11)n, withAi+A2t+ --- 
+Awst+1=n—1, there exist line congruences of order one in |[n] whose funda- 
mental points associated with Ay are coincident with those associated with any 
earlier Aj. 

7. Ordinary space. In the interest cof exposition, the foregoing will now 
be applied to the well known situation in three-dimensional space. 

There is only one plane congruence("*) of order one in [3]. Its type symbol 
is (1)3 and its fixed base is 


(7.1) = 0, 


where x1; (j=1, 2) are linear forms in xo, x1, X2, Xs, with arbitrary constant 


coefficients. 
(7.1) represents a fixed line in [3]. The unique plane through this line and 


a generic point P is 
(7.2) + = 0, 


where P; is the result of substituting the coordinates of P in x» and P, is 
the result of substituting the coordinates of P in —xy. The totality of planes 
(7.2) form a plane congruence of order one in [3]. 

The locus of fundamental points, «/4;, is the line (7.1). 

(4) Room, loc. cit. p. 106. 

(5) While a pencil of planes in [3] is not ordinarily called a congruence, it fulfills our defini- 
tion (k =1), and such usage makes for uniformity of language. 
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In [3] there are line congruences of order one with type symbols (2); and 
(11)3. Each gives rise to an additional line congruence having coincident fun- 
damental points. 

The fixed base for type (2)s is 

X21 X22 
where the x;; are linear forms in xo, x1, X2, x3, having arbitrary constant coeffi- 
cients. (7.3) is a twisted cubic curve whose unique bisecant through a generic 
point P of [3] is 
Pixu + + P3x13 = 0, 
Pix2 + + Psx23 = 0. 


The P; are obtained by substituting the coordinates of P in X; (7=1, 2, 3), 
and X; is (—1)#*! times the determinant formed from the matrix of (7.3) by 
omitting the jth column. 

The totality of rays (7.4) is a line congruence of order one in [3] with 
type symbol (2)3 and whose irreducible locus of fundamental points ¢/; is a 
twisted cubic curve (7.3). 

If all the planes x;;=0 (¢=1, 2; 7 =1, 2, 3) pass through the same point O, 
the twisted cubic (7.3) degenerates into three lines through O. The congruence 
now consists of the bundle of lines through O. The three lines (7.3) through O 
are not fundamental but are members of the congruence. The only funda- 
mental point is the vertex O. This oldest line congruence of order one in [3] 
is thus a special case of our type (2). 

_ In the line congruence of type (11)s, the fixed base is the line (7.1). The 
terminal base is given by 


(7.5) 


where x?) =)°3_,a@x,, and a) are forms of order ra (any positive integer 
or zero) in P;, P: of plane (7.2). For each plane (7.2), (7.5) represents a line 
in [3] intersecting that plane in a point O. 

The equation 


(7.4) 


|| = 0, 


(7.6) Pym + Pi x2 = 0, 

in which P® and P®) are obtained by substituting the coordinates of P for 
the running coordinates in x and —x®, respectively, represents for each 
P,: P; a plane through P. 

Equations (7.2) and (7.6) together represent the ray through P, lying in 
plane (7.2) and through point O. The totality of rays given by (7.2) and (7.6) 
form a line congruence of order one in [3]. 

The locus of fundamental points is composite, consisting of -4;, the fixed 
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line (7.1), and e42, a space curve of order 71+72+1. The equations of <4; 
are formed from (7.5) by replacing the coordinates of P in the a% by the 
running coordinates Xo, x1, X2, x3. These equations contain the line (7.1) 
counted TT: times. The space curve has 711+72 intersections with line 4}. 
Point O is the intersection of curve -4; with plane (7.2), not on 4. 

If, in the preceding type (11); congruence, we impose the restriction that 
the determinant of the coefficients of xu, x», x®, and x2 shall vanish for all 
values of P;:P:, the two fundamental points on a generic ray of the congru- 
ence will coincide on the line (7.1). The locus of fundamental points is the 
line (7.1), provided that not both ry, and 72; are zero. There is established a 
(t11+72, 1) correspondence between the planes and points on (7.1). A generic 
point P of [3] determines a plane through line (7.1) and this plane in turn 
determines one point O on (7.1). Line PO is the unique ray of the congruence 
through P. 

It may be of interest to observe that while it has long been recognized that 
this latter congruence is a limiting case of the line congruence of type (11)s, 
it has hitherto been thought impossible to obtain the coincident case by spe- 
cialization of the parameters in the equations of type (11)s. 

8. Aprile’s congruence. The [n—2]’s determined by the point groups 
of a g*~} belonging to a rational normal curve of order 2n—3 in [2n —3] form 
an [n—2j-congruence of order one in [2n—3]. The intersections of the 
[n—2]’s of this congruénce with an [n] of [2n—3] are lines of a line con- 
gruence of order one in [n]. 

This construction is due to Aprile(*). We shall show that it is a restricted 
case of our type (m—1), congruence. 

Let the parametric equations of the normal rational curve C in [2n—3] 


be (¢=0, 1, 2, -- , 2n—3). 
Consider the fixed base 
Xo Xn-1 
%3°°* 
(8.1) = 0, 
Xn—2 Xn-1 Xn Xen-3 
whose elements xo, x1, , are point coordinates in [2n—3]. Then . 


Pixo + Pox. + = 0, 
Px + Pox +--+ + = 0, 


+ + +++ + = 0, 


(8.2) 


(*) Aprile, loc. cit. p. 204. 
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are the equations of a unique [” —2] through a generic point P of [2n—3]. 
(8.2) intersects the normal curve C in the m—1 points whose parameters 
are given by 


Py! + + + + Paw = 0, 
as may be seen by substituting the parametric equations of curve C in (8.2). 

Thus Aprile’s [n—2]-congruence in [2n—3] is the same as that given by 
(8.2), and the section of this congruence by an [n] of [2n—3] is Aprile’s line 
congruence of order one in [n]. 

The [n]-section of (8.1) is a fixed base. But (8.1) is what Room calls a 
partly symmetric determinantal locus, as is also its [m ]-section. Hence(!”), the 
freedom of the fixed base of Aprile’s line congruence of order one in [n] is 

6 = (1/2)(m — 1)(m — 2)(n + 1) — (wn — 1)n +1 


less than that of our fixed base, for n> 3. 


(?7) Room, loc. cit. p. 160. 
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ON THE APPROXIMATION TO INTEGRABLE 
FUNCTIONS BY INTEGRAL FUNCTIONS 


BY 
H. KOBER 


1. Introduction. The problem of uniform approximation to a function 
f(z) on some finite point set D of the z-plane (s=x+y) (a) by polynomials 
and (b) by rational functions is classical. The theory has been summarized 
by J. L. Walsh(*), who himself has proved a great number of important 
theorems some of which are fundamental. 

Let D be a point set containing the point z= ©. In that case only problem 
(b) has a meaning; there is, however, another generalization of problem (a), 
the approximation to an arbitrary function, defined in D, by integral func- 
tions. Not much attention appears to have been paid to that problem up to 
now, except for the work of S. Bernstein(?). 

Among the point sets D extending to infinity the following are the sim- 
plest ones and appear to be of particular interest: 

I. The real axis — <x< y=0. 

II. The positive part of the real axis O0<x< ©, y=0. 

III. The half-plane — <x< 

IV. The strip — <x< ~,a<y<b. 

V. The exterior of a circle. 

In case V, however, the uniform approximation by integral functions is 
not a reasonable problem, nor for any other domain of which z= is an 
interior point. In that case there is a circle C, with radius R and center at 
the origin, such that the boundary of D lies interior to C and C interior to D. 
Let {f,(2)} (m=1, 2, - - +) be a sequence of integral functions approximat- 
ing to f(z), then | f(z) Se, for | z| 2=R, where ¢,—0 as n—«. Hence 
| fa(z) —fi(z)| Sa+e,, uniformly for | z| 2R, and, by the maximum-modulus 
theorem, for | z| <R. By the Liouville theorem, the function f,(z) —fi(z) re- 
duces to a constant, ¢,, say, therefore f(z) is an integral function, represented 
in the form f(z) =f:(z) +c+/,(z) =c—c,, where c=lim (n— 

Presented to the Society, April 24, 1943; received by the editors July 27, 1942. 

(*) Interpolation and approximation by rational functions in the complex domain, Amer. 
Math. Soc. Colloquium Publications, vol. 20, New York, 1935. 

(*) Lecgons sur les propriétés extrémales des fonctions analytiques d'une variable réelle, Paris, 
1926. Bernstein gives some results concerning problem I; see Theorem 3(d), §6, of the present 
paper. See, furthermore, the Bernstein Theorems 12 and 13 (pp. 105-107); they deal with se- 
quences {f,(z)} of integral functions of exponential type, converging uniformly on the real 
axis to a function f(z) of the same kind, or to a function f(z) possessing some derivatives. A 
problem connected with the latter one has been treated by M. Krein, C. R. (Doklady) Acad. 
Sci. URSS. vol. 18 (1938) pp. 619-623. 


70 


. 


APPROXIMATION BY INTEGRAL FUNCTIONS 71 


The cases I and II will be treated in the present paper, the cases III and 
IV in a further paper. We shall deal with approximation both in the sense of 
convergence in mean with index p (0<p< _@) and in the sense of uniform 
convergence (p= ©). The order of the approximating integral functions will 
be required to be finite and to be as small as possible. Whenever in an infi- 
nite(*) interval a given function is approximated by integral functions of 
order p then it can also be approximated by integral functions of any order 
p’>p (§5). The “best possible” value of p is unity in problem I, while it is 
one-half in problem II (§§6—8). Therefore the integral functions of finite types 
of the orders p=1 and p=} play a dominant part in the results. Problem II 
presents the greatest difficulties. It will, in substance, be reduced to prob- 
lem I, for 1<p<@, however, by means of the approximation by rational 
functions (§7). The problem of “best approximation” will be treated in §9, 
and a generalization of problem I in §10. A number of auxiliary results will be 
needed, some of which are possibly of interest in themselves. They will be 
proved first (§§2—4), in order to simplify the presentation of the main results. 

I am greatly indebted to W. H. J. Fuchs, Aberdeen, in particular for sug- 
gesting to me the use of the Mittag-Leffler functions. 

By |f()|» we denote 


or ess. u.b. | | 


for 0<p< © or respectively; by we denote 


( J or ess. u.b. | f(i)|. 


By Gf (0<8<@) we denote the set of integral functions F(z) such that 
| F(z)| <A e8tolele (4,=A(e€)) for any €>0, by G® the set of integral func- 
tions of order not greater than p. We write Gs for Gj’. Evidently GY >G® 
when 0<p<ao. 

2. Generalization of a Hardy theorem and of a Plancherel-Pélya theorem. 


1. Let0<p;< ~, and let f(z) be analytic for | SR(j=1,2,---,n). 
Then, for 0<rsR, 


2r 
(2.1) W(r) = Dd | | ride 
0 j=1 
is a monotone increasing function of r, and log W(r) is a convex function of log r. 
For n=1 the result is due to G. H. Hardy(*). The general case can be de- 


(?) In consequence of the Weierstrass theorem, in a finite interval a continuous function 
can be approximated uniformly by integral functions of any order. 
(4) Proc. London Math. Soc. (2) vol. 14 (1915) pp. 269-277. 
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duced by an argument due to G. Pélya and G. Szegé(°). 


LemMA 2. Let F(z)€G, and F(t) be bounded in (—~, ~). Then, for 
— 0 <y< oo, | F(x+iy)| 


LEMMA 2’. Let P(z) be a polynomial, let F(z)\GGa, let 0<p<@ and 
P(t)| F(t)|"ELi(— ©, ©). Then 


(a) P(t) | F(t) |? +0 as t> +, 


| + iy) || + iy) | | PCO 
For 0<y< «©, Lemma 2 is proved by applying a Phragmén-Lindeléf theo- 
rem(®) to the function e+®*F(z) (€>0) first in both quadrants of the upper 
half-plane, and then in the upper half-plane, and finally by letting e—0. For 
P(z)=1, Lemma 2’ has been proved by M. Plancherel and G. Pélya(’); the 
general case can be deduced in a similar way, starting from the inequality 


1 
| P(zo) || (20) |» — f | P(zo + re**) | | + re**) | 
0 


where ®(z) is regular for | z—z0| <r. This inequality is a simple application 
of Lemma 1. 

3. On the spaces G,,, and Let p and a be fixed (0<ps ,0<a< ~), 
By Gz,» or G’/) we denote the set of functions f(z), belonging to G, or Gi”? 
and such that f(#)EL, (— ©, ©) or L,(0, ©), respectively. 


Lemma 3(8). Let fa(2)€Gap (n=1, 2,---) and | fult)—fa(t)| as 
m>n— «©. Then there is an element f(z) of Ga,» such that | 10) —fu(t)| »—0 as 


2, 


Lemma 3’. The preceding result holds when Ga,» is replaced by G°?), and 
| fu(t) —fn(t)| » by || fu(t) >» » by 


The lemmas do not hold whenever the order p of the integral functions is 
greater than 1 or 3, respectively(°). 


(*) Aufgaben und Lehrsdtze aus der Analysis, Berlin, 1925, vol. I, pp. 143, 144, nos. 302, 310. 
We start from the lemma: Let (j=1,2, - ++ ,m;k=1,2,-+-+,m) be regular and fi.x(2) | 
be one-valued in the closed domain D, let 0 <p; < ©. Then the function | | fen(s)| Py 

| fmx(2)| attains its maximum value on the boundary of D. 

(*) E.g. E. C. Titchmarsh, The theory of functions, Oxford, 1932, p. 177. 

(7) Comment. Math. Helv. vol. 10 (1937-1938), §§27-30. 

(*) For p= ©, the result is equivalent to the Bernstein Theorem 13. 

(*) Let p>1 or p>4, respectively, be fixed, let f(#) be continuous for —1<#S1 and 
f(t) =0 for |¢| 21. Then f(¢) is not equivalent to an integral function in (— ©, ©) nor in (0, ©). 
On the other hand, by Theorem 3(b), §6, and by the argument of §5, there are integral functions 
g»(z) of the type unity of the order p, approximating to f(#) in {(— ©, ©) or (0, ©), respectively, 
as n— ©; therefore | gm (t) —gn(t)| p—0 or || gm (t) —gn(t)|| p70. For and how- 
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Let f(z) €Ga,». In consequence of the hypothesis, there is a constant A, 
independent of m, such that | falt)| »<A, and a function ®(#)EL,(— ~) 
such that | $(t) —f,(2) | 20 as n— ~. Take first p= ©. By Lemma 2, we have 
|fa(2)| 


(3.1) | fn(z) — | — Salt) 


Consequently the f,(z) converge to an analytic function f(z), uniformly in 
_any finite domain of the z-plane. Therefore, for any value of z, we have 
| f(2)| sAe*'"l, and so f(z)€G,. Since, uniformly in any finite interval 
(—M, M), as we have in (—M, M) and, 
therefore, in (— ©, ~). 
Let now 0<p< _~, and put f,,.=fm—fs. By an argument due to Plan- 
cherel and Pélya(**), we have 


1 1 1 : 3 
| fm.n( 2) |» | (% + t + ir) | dt 


1 1 
<— ar f | + ir) 


Using Lemma 2’(b), with P(#)=1, we have 
(3.2) | fm n(x)! S (2/x)"Pe*| fan(t) |» 


By the argument used in the first part of the proof we deduce the existence of 
a function f(z) belonging to G, and such that, in any finite interval (— _M, M), 


M 
f | #(2) — fa(t) 0 as ©, 
Hence f(#) = in M) and, therefore, in (— ©, 

The case f,(2)€G°” is more difficult. Taking F,(z)=f,(z"), is an 
even function and belongs to G,. By this transformation, the result for p= 
is readily deduced from the corresponding result on G.. Let now 0<p<o, 
and F,(z) =f,(z*). Then we have 


(3.3) > = | |p < A (m = 1,2,---). 
3.4 m(t) — f,(t) = — F,(#) |? dt. 


We have to prove the existence of an even function F(z)€G, such that 
| { F(t) — } |¢| as In consequence of the hypothesis, there 


ever, we can prove: Let fals)EGE ) let y=2x—xp-, let C consist of two straight lines starting 
at the origin and making the angle y between them, and let, uniformly for z on C, fm(z) —fa(z)—0 
as m>n— ©. Then there is an element f(s) of G®” such that, unsformly on C, f,(s)—f(z) as n— ©. 
(°) Loc. cit., §30(73). The proof is based on the Hardy theorem. 


2 
‘ 


74 H. KOBER [July 
is an even function ©®(#) such that | t| ©) and that 
| p(t)| ¢| as n> Using the Plancherel-Pélya argument 


and Lemma 2’(b), we have 


1 1 
| -1 


(3.5) 
Similarly 
(3.6) | s| | — F,(z) |» < 


Hence, if & is an integer greater than p—' and M any positive number, we have 
2* F,,(2) —2*F,(z) 0 as m>n— ©, uniformly for -MsxsM, —MsysM. 
Consequently z*F,(z) tends to an analytic function V(z) as n— ©, uniformly 
in any finite domain of the z-plane. By (3.5) and (3.3), there exists a con- 
stant A’, independent of m, such that ||| F,(z)|*<A’ for |z| <1. By the 
maximum-modulus theorem, we have | F,(z)|?<A’ for |z| <1. Therefore, 
for | z| <1, we have |W(z)| <A"|z|* which shows that F(z) =z“*¥(z) is an 
integral function. Since 2*F,(z)—¥(z), from (3.5) and (3.3) we deduce that 
W(z) and, therefore, F(z) belong to G,. We are left to show that F(t) = ®(#) 
in (— «©, «). Let M be any positive number, and E be the set consisting of 
the two intervals (—_M, —M-') and (A/-', M). By the uniform convergence 
of the sequence {z*F,(z)}, we have 


Fa ae = f | W(t) — || 


as n—«. Hence F(t)=(#) in E and, therefore, in (— ©, ©), and F(#) is an 
even function; so is, therefore, F(z). 

We remark that, in the terminology of S. Banach, Gz,» and G°?) are (B) 
spaces for 1S pS ~("'). For 0<p<1, they are (F) spaces if the distance of 
an element f(z) from the null-element is defined by | f(é|2 or ||f(é)||2, respec- 
tively. 

4. Further basic lemmas. 

LemMA 4. Let x(z)GG, Let 1SpSe, p'+q'21 and 
SODEL,(—@, ~), KHEL(—-, ~). Let 


(4.1) g.(z) = af «lac — z) } f(t)dt (s = x + iy). 


Then g.(2) belongs to Ga. 


Proof. Let p’=p/(p—1). Then we have p’2q and, by Lemma 2’(a), 
x(t)EL,(— ©, ©). By Lemma 2’(b), we have 


(2) With “norm” » or S. Banach, Théorie des opérations linéaires, Warsaw, 
1932, Chapters III and IV. 
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(4.2) | «lat — a(x + |» =| — |p | |p. 
Therefore, by Hélder’s theorem, from (4.1) we deduce that 

(4.3) | ga(2) | | f() |p < 

Let C be the circle =2Re‘* S272), and let | s| SR. Then 


Cc 


= af «fal — } f(t)dt = g.(z), 


(4.4) 


the interchange of integrations being justified by absolute convergence. Hence 
ga(Z) is an analytic function for | z| SR. Since R is arbitrary, g.(z) is an in- 
tegral function, which completes the proof. 


Lema 5. Let k be a positive integer, let x(z) GG, and t*x(t)ELi(— ~). 
Let f(t) be measurable and bounded in any finite interval and f(t) =O(|t|*) 
(t+ ©). Then the function g,(z), defined by (4.1), belongs to Ga. 


Obviously x«(#)EL;(— ©). We have |f(t)| <A’(1+]2]* 
+ | t—s| *) for — © <t< o. Therefore, taking xo(z) =2*x(z), we deduce that 
Sorta’ | a 
4.5 
f | — z)} | de, 
| ga(z)| SAMA +| 


using (4.2), with p’=1. It is easy to show that (4.4) holds which completes 
the proof. 


6. Let ~, let R—1 be a positive integer and p=qk-"', let (x), 
¥(x)EL,(— and M=max {|¢(x)|q, |W(x)|_}. Then 


| — 6(z)— (A = max (1, — 1)/9)). 
Proof. Applying Hélder’s inequality, we have 


Pp 
= 
j=0 
> 
j=0 q/(k—1) 


which gives the required result. Obviously it holds in (0, ~). 
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5. Preliminary approximation theorems. 


THEOREM 1. Let 0<p3S~, let f(t) belong to L,(— ~, ~) or L,(0, ~), let 
p<1 or p<}, respectively. Let f(t) be approximated by elements g,(z) of G® 
(n=1, 2, ---), that is, 


(5.1) | f(t) — gn(t)|> 20 or |If(t) — as 


for p<1 or p<}, respectively. Then f(t) reduces to a constant which is zero 
for 0<p<o@. 


I. The interval (— ©, «). Here the theorem is a corollary of a result due 
to V. G. Iyer(!*). It can also be deduced from the Lemmas 2’(a) and 2 by 
means of the Liouville theorem. 

Il. The interval (0, ©). Since p<}, we have g,(z)€Go”’. Hence the 
even function F(z) =g,(z*) belongs to Go. Since g,(#)EL,(0, ©), we have 
| | UP? F,(t)EL,(— ©, ©) and, therefore, F,(#)€L,(— ©, ©). We can again 
employ the Lemmas 2’(a) and 2, which proves the theorem. 


THEOREM 2. Let 0<pS @ and let f(t) belong to L,(—~, ~) or L,(0, ~), 
respectively, let f(t) be approximated by elements g,(z) of G® (m=1, 2,---), 
and let o be any number greater than p. Then f(t) can be approximated by in- 
tegral functions of order a. 


I. The interval (—«, ~). By Theorem 1, we may suppose that p21, 
o>1. We need only find a single integral function W(z) of order ¢ such that 
©) (0<pS For (5.1) implies that | f(#) —h,(t)|,—0 as 
n—«, where h,(z)=g,(z)+n-'W(z), and the integral function 4,(z) is of 
order 

Let 0<A<1. The Mittag-Leffler function 


(5.2) Ais) = = 


is well known to be an integral function of the type unity of the order \—'. 
Let 0<e<1-—X. Then, uniformly in the angle (r/2)(A+ 6) Sarg 
—(r/2)(A+€), we have 
(5.3) E,(z) = O(| z|-*) (|s|—> @). 
When & is any integer greater than p~! then the function W(z) =(Ey,-(iz))* 
has the required property. 

II. The interval (0, ©). It will suffice to take ¢>1/2. The integral func- 


tion W(z) = (Z1;.(—2))* is an integral function of order ¢, and W(t)EL,(0, @), 
which proves the theorem. 


(*) Quart. J. Math. Oxford Ser. vol. 7 (1936) pp. 294-299, Theorems 2 and 4. 


‘ 
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6. Approximation functions of lowest order. Let O0<ps~ and 
SOEL,(— ©, @), and let f(t) not reduce to a constant. By Theorems 1 and 2 
there exists a lower bound of the p’s, provided that f(#) can be approximated 
at all by integral functions of finite order. This minimum is in fact attained. 


THEOREM 3(a). Let 0<p< _@ and f(t) belong to L,(—~, ~) or L,(0, ~), 
respectively. Then there are functions g.(2)€G, or €G!” (0<a—~@) such that 


| — galt)|p—>0 or — as ao. 


THEOREM 3(b). Let f(t) belong to L.(—~, ~©) or L.(0, ©), respectively. 
A necessary and sufficient condition for the existence of functions g.(z)CG, or 
EG!” (0<a—~) such that 


| — ga(#) —0 oF ga(t)||..—> 0 as 
ts that f(t) or f(t"), respectively, is uniformly continuous in (— ©, ~). 


The following theorems concern classes of functions f(#) which need not 
belong to any L,(— ~, ~) (0<pS 


THEOREM 3(c). Let, for some r (0<r31), and uniformly for — © <t;< @ 
(j=1, 2), f(d) satisfy the Lipschitz condition 


(6.1) | f(ts) — f(t) | S Al — 


Then f(t) can be approximated by functions g.(z)GGa (0<a—~), uniformly 
in(— ©, ), 


THEOREM 3(d). (Bernstein’s theorem)("). The preceding result holds if f(t) 
is continuous and f(t)—fi(t)—0 as and f(t) —f2(t) 90 as t->— where 
and f2(t) are linear functions. 


7. Proofs of Theorems 3. 
I. The interval (— ©, ©). We start from the following result("). 


Lemma 7. Let f(t)EL,(— ©, (1S p< @), or let f(t) belong toL..(— ~) 
and be uniformly continuous in (— ©, ©); let and 
=1, and let g(x) be defined by (4.1), replacing z by x. Then, as a-~, 
—ga(t)| » (1S pS ~). 


Let x(z) satisfy the conditions both of Lemma 4 and of Lemma 7. Take, 
for instance, the Féjer kernel «(z) =(2z*)~'(1—cos z). Then Theorem 3(a), 
for 1S p< , and the sufficiency of the conditions in Theorem 3(b) is an im- 
mediate consequence of the lemma. The necessity of the condition in 3(b) is 
readily deduced from the Bernstein("*) theorem: 

(8) Loc. cit., 106, footnote. It can be deduced from Theorems 3(b) and (c). 


(**) E. Hille, Trans. Amer. Math. Soc. vol. 39 (1936) pp. 131-153 §1. 
(5) Loc. cit., 102, Corollary 7. 
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Lemna 8. Let g(z) EG, and | g(2)| SA in Then | g’(2)| SaA in 
(— ©, «), therefore g(t) is uniformly continuous in (— ©, ~). 


Let now 0<p<1 and f(t)EL,(— ©, ~). Let & be an integer not smaller 
than p-! and kp=gq. Defining {f(#)}"/* in a suitable way, we have {f(#)}1/* 
EL,(—«, «~). Let x(z) be defined as above, and let 


(7.1) galt) = (< f = 


where B=ak—. Then | —ga(t)|»—0 as Obviously ys(z) and 
es=| as B> By Lemma 6, we have 


| f(t) — {va }*#|, Al 70 as Bom, 


1/k 1/k 
max(| |e |r|) ta <A’. 
Hence | f(t) —ga(t)| 0 asa, and obviously ga(z) EGa. 
To prove Theorem 3(c), we take, for instance, 


(7.2) x(z) = 


Since «(z) and, by (6.1), f(é), with k=1, satisfy the hypotheses of Lemma 5, 
the function g.(z), defined by (4.1), belongs to G.. We have 


f(2) gals) = — fle + 2) 


Hence, by (6.1), 
| f(x) | < Aa | x(at) | |¢ = Ao f | | dt, 


which tends to zero as a, uniformly for — © <x < «. This completes the . 
proof of the assertions concerning the interval (— ©, ). 

As an application of Theorem 3(c) we obtain the following result: 

Let 0<r<1, and let z* be uniquely defined by —7/2 Sarg 2 $37/2. Then 
can be approximated by functions (0<a—©), uniformly in 

The result follows immediately from the inequality | #—4| $2'-"|4—4|". 
It holds for 0<r<, and the corresponding result holds for the interval 
(0, ©), with g.(s)€Go”; this will be shown in the paper indicated above. 
We remark that, for r~1, 2, 3, - - - , # cannot be approximated by rational 
functions in (— ©, ) or (0, ~), 

II. The interval (0, ~). This case is, for p= ©, readily deduced from 
the corresponding result for the interval (— ©, ©). We have to take into 


since 
gf 
. 4 96 
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consideration that F(#) =f(é) is an even function and, therefore, 


1 1 — cos a(t — z) 
(7.3) Ya(z) = — F(t)dt 


wa J» (¢ — z)? 


is an even function as well. Let now p=1 and again F(t)=f(é). Then 
tF(t)ELi(— ©, ©). Replacing F(t) by ¢F(é) in (7.1), we see that y.(z) is an 
odd function, and that | tF(t) 10 asa— Taking g.(z) 
we have g.(z)€G°” and ||f(¢) —g.(#)||: 0 as a». 

For the further investigation, we need 

LEMMA 9(°). The sequence (n=0, +1, +2,---) 
is a complete orthonormal system with respect to L,(—~, ©) for1<p<o. 

Consequently(**), for 1<p< ©, any element f(#)EZ,(0, ©), can certainly 
be approximated, in the mean of order ~, by finite linear forms of the 
(«—t)"(¢+2)-*—. Therefore we need only show, that, given e>0 and any in- 
teger m, there is an element g.(z)€G!” such that || 
<e, or that there exists an even function y.(z)€G, such that 


(i 
(7.4) (i+ 
Let x(z) be defined by (7.2), and let 


Pp 
— ya(t)| | t| dt < e. 


; 
Then, for a2A=A(e), we have 
(7.5) — galt)| <e. 
The function g,(z) is even, and we have 
(i — #) "at Hi — 


= - x)} taf «fac x)} 


= $a(x) + Va(x), 
where Xo(z)=zx(z). Obviously | —wy.(t) | as an, 
while, by a well known convexity theorem(!’), 


(*) H. Kober, a forthcoming paper in Quart. J. Math. Oxford Ser. For the case p=2, 
see also E. Hille, Compositio Math. vol. 6 (1938) p. 99. 

(*") E.g., Inequalities, G.H. Hardy, J. E. Littlewood and G. Pélya, Cambridge, 1934, Theo- 
rem 204. 


(i + n+1 (i + 
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Hence |ba(x)| »—0 as a—; consequently, for some a>A, we have 
(7.6) | — + — tg. (t) |, <. 


By Hélder’s inequality, from (7.5) and (7.6) we deduce (7.4). For, taking 
x(é) = —g.(t), we have 


Thus we have proved the theorem for 1S p< The case 0<p<1 is treated 
as the corresponding case in (— ©, ©), 

8. Uniqueness. By the argument used in Theorem 2, it is evident that 
the sets of approximating functions, occurring in Theorems 3(a)-(c), are not 
at all unique. We can construct approximating functions of order p=1 or 
p=, respectively, with types tending to infinity, even when f(t) reduces to a 
null-function. Certainly for f(t)EL,(—©, ©) or E€L,(0, ©), however, the 
types tend necessarily to infinity provided that f(t) is not equivalent toa 
constant or to an integral function of finite type of the order p=1 or p=}, 
respectively. We can prove 


THEOREM 4. Let 0<pS@ and f(t)EL,(—-, or EL,(0, ©), respec- 
tively, let f(t) be approximated by functions g.(z)€GGa or EG”). Jet Ba. be the 
actual('*) type of ga(z), and let c=lim inf 8, (a—@) be finite. Then there is 
an element F(2)€G. or respectively, such that f(t)= F(t) in 
or in (0, ). 

By the hypothesis, there is a sequence {a,} (n=1, 2,---) such that 
and that the sequence {8,} of the actual types of the gq, (2) 
tends to c. Given we have 8, Sc+e for n2A =A(e), or 
Gi), respectively, and 

| San (t) — 0 or || ga, (4) = £a,(2)|| 0 
as m>n—. By Lemma 3, the g.,(z) converge to a function F(z) belonging 
to or respectively. We can take e—0. Evidently F(t)=f(t) which 


proves the theorem. 
9. The problem of best approximation. 


THEOREM 5. Let 0<pS@ and f(t)EL,(— ~) or EL,(0, respec- 
tively, and let a be fixed. There exists a function g(2)€G. or €G°” of best 
approximation to f(t), satisfying the inequality | fd —8()|» s | fo- —h(t)| » or 
If) for any h(s)EG. or respectively. For 
1<p< , g,(z) is certainly unique. 


Let f(t)EL,(— ©, ©), and let ho(z) be an arbitrary element of G,,, (§3). 


Ba =lim Max} (log | ga(s)| /r). 


= 
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Denoting by E the set of elements h(z) of G. such that |f(t)—h(t)|,SAo 
= | F(t) —ho(t) | p, by the argument of §3 we deduce that, unifurmly for h(z) EE, 
| (2) | <Az,e*!¥l, By a well known argument(!*) we can now deduce the re- 
quired result, using the Lemmas 3, 3’ and, in the proof of the last assertion 
of the theorem, Minkowski’s inequality. The interval (0, ©) can be dealt with 
in a similar way. 


THEOREM 5’. Let f(t)EL2(— ©, ©). Then the sequence of best approxima- 
tion is {g.(2)} ={D.f} (0<a—~@~), where 


Daf = Dalf; 2] = f 


ts the Dirichlet singular integral. 


By Lemma 4, we have D.f€G.. Let F(u) be the Fourier transform of f(¢). 
It is well known(?°) that 


x] = fF 


Hence ®(u), the Fourier transform of f(x) —D,f, vanishes for | ze <a. By the 
Paley-Wiener(?!) theorem, ¥(u), the Fourier transform of D,f —h(x), vanishes 
for | 2e| >a whenever h(z)€G,,2. Since the Fourier transform is isometric, we 
have 


Consequently 


The right side is greater than | f(x) —D.f \3 when h(x) is different from D.f, 
which proves the theorem. It can be shown that there exists a constant a 22 
such that, for a<p< © and for 1<p<a(a—1)-', D,f is not a function of best 
approximation of f(¢). 

10. A generalization of problem I. Let C be a point set possessing the 
following properties: 

(i) There exists an integral function w= (z) which maps C on the real 
axis of the z-plane in a one-to-one correspondence. 

(ii) The inverse function z=YV(w) is continuous in (— ©, «), 


(%) J. L. Walsh, loc. cit., chapter 12. 

(*) E.g., E. Hille, loc. cit. footnote 14, §5. 

(*") Fourier transforms in the complex domain, Amer. Math. Soc. Colloquium Publications, 
vol. 19, 1934, pp. 12-13. 
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Again let C’ be a point set satisfying (i) and, instead of (ii), the weaker 


condition 
(ii)’ ¥’(w) exists and is finite for almost all w in (— «©, ©). From Theo- 


rems 3(a) and (b) we can deduce 


THEOREM 6. Let f(2) be bounded and uniformly continuous on C, or let, for 
p=1, 1/2, 1/3,---, f(2)EL,(C’), that is to say, let f(z) be measurable on C’ 
and the integral 


f | f(z) |? | dz| = f | f(Y(w)) |? | ||dw| a positive integer) 
c’ 


be finite. Then there are integral functions ha(z) such that, as a, 


(10. 1) ess. u.b. | f(z) — ha(z)| +0 
or 

— he P| dz|—> 0, 
(10.2) J - 4. 0 


respectively, and | ha(z)| <Age*!*! or <A,| 


Making use of Theorem 3(c), we can, in the general case 0<p< ©, con- 
struct integral functions /,(2) satisfying (10.2), certainly if C’ satisfies (i) and 
the following conditions: 

(ii)’ W’(w) is bounded in (— ~), 

(iii) For some r (0<rS1) and uniformly in (— ©, ), 


| — Al wi — 
For instance, taking w= ®(z) =z?—2ia,a>0, and taking C’ as the upper 
branch of the hyperbola xy=a, it will be seen that, for }S3p<@, C’ has 
the required properties, with r=1, A =(8a)"/*(4ap)—". 
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THE TRANSFORMATION OF DOUBLE INTEGRALS 


BY 
R. G. HELSEL AND T. RADO 


CHAPTER I. INTRODUCTION 
1.1. Let there be given a continuous transformation T of the form 
T: x= x(u, 2), y = y(u, 2), 
where x(u, v), y(u, v) are defined and continuous on the unit square 
So: OSusl, 
Designate the continuous image of the boundary of So under T by C. We shall 


be concerned with the validity of the topological transformation formula (ap- 
parently first studied by Schauder [1](#)) 


ff F(x(u, v), y(u, v))J(u, v; T)dudv 
(1) 
+0 +00 
-f f F(x, y)u(x, ¥; So, T)dxdy, 


where F(x, y) is any measurable function in the xy-plane; J(u, v; T) is the 
Jacobian, x,(u, v)y.(u, v) —x.(u, v)yu(u, v); and u(x, y; So, T) is equal to the 
topological index of the point (x, y) with respect to C if (x, y) is not on C; 
otherwise, u(x, y; So, T)=0. The topological index of a point (x, y) with re- 
spect to the continuous curve C is defined as follows: 


DEFINITION. As (u, v) describes the boundary of So once in the counter- 
clockwise sense, (x(u, v), y(u, v)) describes in the xy-plane the directed closed 
continuous curve C, and, tf (x, y) is a point not on C, the change of the continu- 
ously varying argument of the vector from (x, y) to (x(u, v), y(u, v)) ts of the form 
2k, where k is an integer (positive, negative, or zero). The integer k is called 
the topological index of the point (x, y) with respect to C. 


If F(x, y) =1, formula (1) becomes the area formula 


(2) f Re v; T)dudv = [of y; So, T)dxdy. 


If the transformation T is biunique, | u(x, y; So, T )| assumes only two values, 
0 or 1, and formula (1) reduces to the ordinary transformation formula 


Presented to the Society, April 24, 1943; received by the editors January 6, 1943. 
(*) Numbers in brackets refer to bibliography at end of the paper. 
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Sf (x(a, 2), | dude = Sf 


where 7(.S¢) is the image under T of S), the set of interior points of So. 

1.2. Radé and Reichelderfer have established formula (1), under the as- 
sumption that the integral on the left exists and the locus of C is a set of 
planar measure zero, for a wide class of continuous transformations which 
they call K;. (Many other results hold as well in ciass K3. See Radé and 
Reichelderfer [1].) The purpose of the present paper is to show that this re- 
sult implies all of the results on the transformation of double integrals in the 
literature of which we are aware (see McShane [1], Morrey [1], Rademacher 
[1], Radé [1, 2, 3], Schauder [1], Young [1, 2, 3]). It is not difficult to verify, 
by means of the two theorems of Radé and Reichelderfer quoted in 1.6 and 
1.7, that the transformations considered by McShane [1], Morrey [1], Rade- 
macher [1], Radé [1, 2, 3], and Schauder [1] belong to class K; and that the 
formulas established by these authors follow from formula (1). On the other 
hand, it is not apparent that Young’s results are implied by those of Radé 
and Reichelderfer. We shall devote our attention, then, to the work of Young 
[1, 2, 3], ultimately showing that all of his results can be accounted for in 
terms of the results of Radé and Reichelderfer. (It is interesting to note that 
our methods will place many of the transformations considered by Young in 
Morrey’s class L. Morrey [1] has established formula (2) for transformations 
in class L.) 

1.3. The two transformation formulas which Young develops are the same 
on the left as formulas (1) and (2); however, the right sides of Young’s 
formulas are not Lebesgue integrals, but the limits of Lebesgue integrals. 
We shall now give a brief description of these limits. Let o be the generic nota- 
tion for a subdivision of the boundary of So by points Pi, - - - , P», numbered 
consecutively in the positive sense around So. Let II, be the directed closed 
polygon “inscribed” in C which is formed by straight segments connecting 
in order the images of Pi, - - -, Pm, P: under T. Define u(x, y) to be equal 
to the topological index of the point (x, y) with respect to II, if (x, y) is not 
on II,; otherwise, set u(x, y) =0. Then if F(x, y) is any measurable function 
in the xy-plane, Young defines the integral of F(x, y) over the area of the polygon 


II, as follows: 
+00 +00 


providing the Lebesgue integral on the right exists. Young then defines the 
integral of F(x, y) over the area of the curve C in the following way: 


f F(x, y)dedy = lim (¥) f [re y)dxdy, 


‘ 


1943] TRANSFORMATION OF DOUBLE INTEGRALS 85 


providing the limit on the right exists and is independent of the manner in 
which ||o||—+0. (||o|| is the norm of the subdivision ¢.) 
1.4. For certain classes of continuous transformations 


T: x= x(u, 2), y = y(u, 2), (u, v) € So, 
Young has established the area formula 
3 J(u, 0; T)dudv = dxdy, 


where C is the image of the boundary of Sy under T and (Y)/ffcdxdy is as- 
sumed to exist. In a few instances, he also asserts that the general transforma- 
tion formula 


(4) f v), y(u, v))J(u, v0; T)dudv = (Y) y)dxdy 


holds for any measurable function F(x, y) in the xy-plane as soon as the in- 
tegral on the right exists. Throughout his work, Young assumes that the co- 
ordinate functions x(u, v) and y(u, v) of the transformation T are absolutely 
continuous on every horizontal and on every vertical in So; hence, the con- 
tinuous curve C is rectifiable and its locus is a set of planar measure zero. 

1.5. A comparison of formulas (1) and (4) leads to this engaging question: 
When does 


fr. 2, F(x, y)u(x, So, T)daxdy? 


Young neither raises nor answers this question. Instead, he always assumes 
the existence of the limit on the left and defines it to be the new integral, 
(VY) SfcF(x, y)dxdy. It is clear that an improvement is achieved if in formula 
(4) we can replace (Y){fcF(x, y)dxdy by ff F(x, y)u(x, y; So, T)dxdy. 
(Schauder [1] has already enunciated the desideratum of replacing 
(Y)ffeF(x, y)dxdy by something more tangible.) In fact, in every case 
considered by Young but one, we shall accomplish this replacement by 
showing that the transformation belongs to class K; and, hence, formula 
(1) holds as soon as the integral on the left exists (cf. 1.2, 1.4). In the one 
exceptional case just mentioned, it would seem that Young did not estab- 
lish anything (cf. 3.10). (The preceding statements should not be con- 
strued to mean that we have succeeded in identifying (Y) SfoF(x, y)dxdy 
and f. Be Ie F(x, y)u(x, y; So, T)dxdy in all the cases of Young. Actually, it 
appears to us that a general study of the relation of these integrals would lead 
to questions of great difficulty and interest.) 

1.6. To place Young’s transformations in class K3, we shall utilize the 
two following theorems of Radé and Reichelderfer [1]. 
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THEOREM. If the transformation T is continuous on the domain D of the 
uv-plane; the partial derivatives x,,(u, v), x»(u, V), Yu(U, V), Yo(u, v) exist and are 
continuous on D; and J(u, v; T) is summable on D, then TEK;(D). 


1.7. CLOSURE THEOREM. Let there be given bounded domains D and D,, in 
the uv-plane and continuous transformations 


T: «x = x(u, 0), y = y(u, (u,v) ED, 
Th: Y= Yn(u, 2), (u,v) E Dn, 


with the following properties: (i) D,CD and, for every closed set FCD, there 
exists an NMo=MNo(F) such that FCO, for all n>mo; (ii) J(u, v; T) exists a.e. 
(almost everywhere) and is summable on D; (iii) T,€ K3(D,) for n=1, 2, ; 
(iv) for every closed set FCD, x,(u, v)—, uniformly to x(u, v), yn(u, 
formly to y(u, v), on F and 


lim Sf T,) — J(u, 0; T) | dudv = 0. 


Then TEK;(D). 


(The Closure Theorem is still valid if the condition 
lim ff 1 (u, v; T,) — J(u, v; T) | dudv = 0 
n F 


is replaced by the weaker condition 


lin Sf | dudo = | 9; 7)| duds.) 


1.8. The method employed in Chapter III to place each transformation 
considered by Young in class K; is this: We approximate to the given trans- 
formation by a sequence of transformations which belong to class K; (to be- 
gin, we use transformations which belong to class K; because of 1.6) and 
satisfy the conditions of the Closure Theorem, thus placing the given trans- 
formation in class K3. The feature of this paper is the utilization of integral 
means to obtain the approximating transformations. 

The second chapter of this paper is a systematic presentation of general 
theorems in analysis and properties of integral means which we apply in 
Chapter III to prove our results. Since integral means give rise to non- 
denumerable sequences of approximating functions, many of the theorems in 
Chapter II are concerned with such sequences; however, we work only with 
denumerable sequences in Chapter III. The intrinsic interest of our results 
on non-denumerable sequences of integral means justifies their inclusion. 

1.9. The cases which we treat in Chapter III are numerous and contain 
many assumptions so we shall not list them here, but we shall summarize 
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the essential differences between our results and those of Young. 

1. By placing Young’s transformations in class K3, we do more than es- 
tablish formula (1) of 1.1 for them, since many other results hold as well for 
transformations in class Ks (cf. 1.2). 

2. In most of his cases, Young did not establish formula (4) of 1.4, but 
only the area formula (3). On the other hand, we shall establish the general 
topological transformation formula (1) for all of Young’s transformations. 
Young established formula (3) for the transformations considered in 3.6, 3.7, 
3.9, 3.12, and 3.14; formula (4) for the transformations of 3.8 and 3.14.1. 
The transformation of 3.14.1 is the most general case for which Young claims 
to have established formula (4). 

3. Throughout Chapter III we assume there is given a continuous trans- 
formation T: x =x(u, v), y=y(u, v), (u, v)ESo, which satisfies the standard 
hypothesis Ho: x(u, v) and y(u, v) are absolutely continuous in the Tonelli 
sense (cf. 2.17) on So and the image of the boundary of S» under 7 is a set 
of planar measure zero. The additional restrictions on T which we use in 
Chapter III do not differ from restrictions used by Young; however, Young 
always makes the standard assumption that x(u, v) and y(u, v) are of bounded 
variation in the Tonelli sense (cf. 2.17) on So and are absolutely continuous 
on every horizontal and on every vertical in So. Clearly our hypothesis Hy 
is less restrictive than Young’s standard assumption (cf. 1.4). 

It must be remembered that in all cases our theorems are not identical 
with those of Young (cf. 1.3, 1.5). The references to Young’s work which ap- 
pear in 3.6, 3.7, 3.8, 3.9, 3.12, 3.14, and 3.14.1 direct the reader to the theo- 
rems of Young which correspond to our theorems of those sections. 

1.10. Young’s theorems on the transformation of double integrals are not 
final in character; hence, we have attempted to find a general theorem which 
would at once account for all of his results. The following proposition, which 
we have not been able to establish or deny, seems a likely generalization (cf. 
3.15): 

If T satisfies Ho (cf. 1.9) and J(u, v; T) is summable on So, then TE K;3(S$) 
and formula (1) holds if the integral on the left exists. 


CHAPTER II. AUXILIARY THEOREMS AND INTEGRAL MEANS 


2.1. Most of the results listed in this chapter are known; however, a few 
of them are new—for example, see 2.12, 2.21, 2.21.1. Proofs are included 
whenever convenient references are not available. 

2.2. The following concept will be used extensively in the present paper: 


DEFINITION. A family F of measurable functions defined on So is said to 
have the Vitali property—briefly, property (U)—on a measurable subset E of So 
if the following condition is satisfied. To every e>0 there corresponds a = 6(e) 
>0 such that 
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<e 


if v)dudv 


for every function f(u, v) of F and every measurable set eC E with |e <6. (Ife 
is any set, |e| denotes the exterior measure of e.) 

2.2.1. CoroLiary. If a family F of functions f(u, v) has property (VU) ona 
measurable subset E of So, then the family F* of functions | f(u, v)| also has, and 
conversely. 

2.2.2. CoRoLLary. If 7 is a family of functions possessing property (VU) on 
a measurable subset E of So and G' is a family of measurable functions which are 
uniformly bounded on E, then the family of functions f(u, v)-g(u, v), f(u, v) EF 
and g(u, v) EG, has property (VU) on E. 

2.2.3. COROLLARY. If a family F of functions has property (VU) on a measur- 
able subset E of So, then {fuf(u, v)dudv exists for every function f(u, v) of F and 
these integrals are uniformly bounded. 


2.3. THEOREM. If on a measurable subset E of So, fn(u, v) 20 and f,(u, v) 
—f(u, v) a.e., a necessary and sufficient condition for f(u, v) to be summable 


on E and 
v)dudv offs v)dudv 


is that {f.(u, v)} have property (VU) on E. 


Proof. See de la Vallée Poussin [1, p. 477]. This theorem may be extended 
to a non-denumerable sequence of continuous functions f,(u, v), 0<a<1, 


such that f.(u, v)—a.of(u, v) a.e. on E. 


2.3.1. COROLLARY. If on a measurable subset E of So, fa(u, 0) —>nf(u, v) @.€., 
a necessary and sufficient condition for f(u, v) to be summable on E and 


Sf | Sn(u, 0) — f(u, v) | dudv — 0 


ts that { fru, v)} have property (VU) on E. 


2.4. Many of the properties of integral means which we shall utilize de- 
pend upon the next two theorems concerning partial differentiation of an 
indefinite integral. 


THEOREM. If f(u, v) ts summable on So and 


F(u, 9) = f f Se 


7 
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then, for every v, F,(u, v)=J,f(u, n)dn, provided u does not belong to a set of 
measure zero which is independent of v. 

Proof. See Fubini. [1]. (Similarly, for every u, F,(u, v)=Jf(é, v)dt, pro- 
vided v does not belong to a set of measure zero which is independent of x. 
A majority of the theorems of this chapter are stated in an unsymmetric 
form as regards u and v. In each such instance, a second theorem may be 
obtained by interchanging the réles of u and v.) 


2.5. THEOREM. If f(u, v) ts summable on So and 
0) = f #0, oda, (u, 2) E So, 
0 


(for a.e. v, g(u, v) exists for every u) then g,(u, v) =f(u, v) a.e. on So. 


Proof. See Helsel and Young [1]. 
2.6. We shall have occasion to employ the following types of integral 
means. 


DEFINITION. If f(u, ») ts summable on So and 0<h<1/2 is yjixed, then 


h 1 h A 1 uth oth 
fiw.) = +a, 9+ = ff 
defined on the square S,: hSus1i-—h, hSvs1-—h, is called the h-h-integral 
mean of f(u, v). 


DEFINITION. If f(u, v) is summable on Sy and 0<h<1/2 ts fixed, then 


uth 


1 a 1 
9) =f fut = = f 

defined for a.e. v on the rectangle Ryo: h&usi—h, OSv31, ts called the h-in- 
tegral mean of f(u, v). 

2.7. THEOREM. If f(u, v) is continuous on Sy and R (a closed oriented rec- 
tangle) comprised in Sis fixed, then, as h-0, f,(u, v) uniformly to f(u,v) on R, 

Proof. Well known properties of integral means such as this are estab- 
lished in Bray [1], Morrey [1], and Radé [2]. When theorems on integral 
means are stated without proofs or references, the reader should consult the 
above papers. 


2.8. THEOREM. If f(u, v) is continuous on So, then on Sy, 


wth 
{f(u + h, 1) — f(u — h, 2)}dn. 


v—h 


ou 
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2.9. THEOREM. If f(u, v) is summable on-So, then f,(u, 0)—na.of(u, v) a.e. 
on So. 


Proof. The result is immediate from Lebesgue’s theorem on absolutely 
continuous set functions. See de la Vallée Poussin [1]. 


2.10. THEorEM. If f(u, v) is summable on So and RCS is fixed, then 
{fr (u, v)} has property (UV) on R. 


Proof. Let e be any measurable set comprised in R. Then for hk small 
enough that RC.S;, we have 


sa { f fu + a, 0+ 6)| dads 
| dude} dad 


So, |e|)dadp = |, So, | 


f v)dudv 


where 
e= E [(u — a, » — 6) Eel, 


(u,v) 


(| f|, 50, | e|) = SS | 2) | dudo. 


Because of the absolute continuity of SSelf(u, v) | dudv as a set function on So, 
2(|f|, So, |e|)—> 1-00; hence, {fR(u, v)} has property (UV) on R. 


2.11. THEOREM. If f(u, v) and g(u, v) belong to associated Lebesgue classes 
L» and L* respectively, where p>0, g>0, 1/p+1/q=1, and if RCS$ is fixed, 
then {fx(u, v)-gh(u, v)} has property (UV) on R. 

Proof. The result follows from Hélder’s inequality and reasoning similar 
to that employed in 2.10. 

2.12. THEOREM. If f(u, v) is summable on So and the closed interval (a, b) 
is strictly interior to (0, 1), then 


b 
f | 9)| du —> f | f(u, v) | du, 


provided v does not belong to a set of measure zero which is independent of a and b. 


Proof. By 2.9, fh(u, v)—nr.of(u, v) a.e. on So; hence from the Theorem of 
Fubini, f,(u, v)—n».of(u, v) a.e. on a.e, line v=constant in So. Let E} be the 
exceptional set of v values. Also, by the Theorem of Fubini, fal f(u, v)| du 


= 
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exists for v not belonging to an exceptional set E? of measure zero. 
Choose two rational numbers a and 8 such that 0<a<8<1. Then define 


g(v) = f | f(u, ») | du, v not € 


where 6>0 is rational and OSa—6<8+531. Since g(v) is summable on 
0<v3X1, it follows from Lebesgue’s theorem on absolutely continuous set 
functions that 


1 
+ n)dn —— g(0), 
2h 


for v not belonging to an exceptional set E%(a, 8, 5) of measure zero. Put 


E,=E+E?+ E} (a, B, 6). 


a,B,8 rat. 


Then | Z,| =0. 
For v notC EZ, and for a fixed rational 6>h, 


[Aw f fff [fu + & | duh dean 
“| 9+ du} ae 


h 
Therefore 
lim sup f | fa(u, v) | du S g(v) = f | f(s, v) | du, vnot € E,. 
a 


As 6—0 through positive rational values, we obtain 


8 
lim sup ff au | 0) | du, vnot € £,. 


This inequality holds for any two real numbers 0 <a <b <1, since forv not CE, 


lim sup au stim sup a 

Bn b 
sf | »)| du f | f(u, v) | du, 


where {a,} and {8,} are sequences of rational numbers converging to a and b 
respectively and satisfying 0<a,<a<b<6,<1. 


= 
Fy 
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By the Lemma of Fatou, for any two real numbers 0<a<b<1 


-f lim inf | fa(m, 0) | de 
a a h-0 
(2) 
lim inf f | 2) | du, E, + 


The result follows from (1) and (2). 


2.13. Lemma. On asusb, let there be given a family of functions f.(u), 
0<a<1, which converge a.e. to f(u) as a—0. If, for 0<a<1 and asus, 
|fa(u)| Sg(u), where g(u) is summable on aSuSb, then J. f(u)du. 


Proof. Deny the truth of the assertion. Then there exists a denumerable 
subsequence of functions {f.,(u) } such that f.,(u)—af(u) a.e. on aSu<b and 
sf ‘fa,(u)du does not converge to f ‘f{(u)du. This contradicts Lebesgue’s theo- 
rem on termwise integration. 


2.14. THEOREM. Jf f(u, v) 20 is summable on So, \(u) 20 is summable on 
O0susil, and V(u)= Sf(u, v)dv is a bounded function of u (on the set where it 
exists), then for a fixed rectangle R: 0<asusb<i1, 0<csvsd<1i, 

ff ACH ff v)d(u)dudo. 
R add R 
Proof. Choose h small enough that RC S,. By 2.12, for a.e. u, asusb, 


d d 
Also 


d(u) [few v)dv = f +a,0+ Ps} dd 


< 
4h? 


f f V(u + a)dadB M-d(u), 


where M is a bound jor V(u). By the lemma of 2.13 


Hence, by the Theorem of Tonelli, 


f f falu, ——> f f f(u, v)d(u)dudo. 
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2.15. THEOREM. If f(u, v) is continuous on So, then on Rio (cf. 26) 


Ou h 


2.16. THEOREM. If f(u, v) is summable on So, then f,(u, v)—>r0f(u, v) a.e. 
on So. 


Proof. The result is immediate from 2.5. 

2.17. For the purpose of Chapter III, we shall need certain theorems per- 
taining to functions which are absolutely continuous in the sense of Tonelli— 
briefly, A.C.T.—and to their integral means. 


DEFINITION. The function f(u, v) is A.C.T. on So if (i) f(u, v) ts continuous; 
(ii) f(u, v) ts of bounded variation in the sense of Tonelli; (iii) for a.e. v=n, 
0<n31, the function f(u, n) ts absolutely coniinuous in u, and, for a.e. u=é, 
OSES1, the function f(E, v) is absolutely continuous in v. Denote by V(n; f) the 
total variation of f(u, n) as a function of u on OSux1; denote by V(E; f) the 
total variation of f(§, v) as a function of v on OSv3S1. Then f(u, v) is of bounded 
variation in the sense of Tonelli if V(n; f) and V(é; f) are summable functions 
of n and & respectively on the interval (0, 1). 


2.18. If f(u, v) is A.C.T. on So, the partial derivatives f,,(u, v) and f,(u, v) 
exist a.e. and are summable on So; moreover, for a.e. v, V(v; f) = So | Sulu, v) | ou 
and, for a.e. u, V(u;f)=J, | fo(u, v)| dv (see Morrey [1]). 


2.19. THEOREM. If f(u,v) is A.C.T. on So, then Of, (u, v)/du = (Of(u, v)/du), 
on Sy. 


2.19.1. COROLLARY. If f(u,v) is A.C.T. on So, then Of, (u,v) /Ou—n.0df (u,v) 
a.e. on So. 


Proof. The result is immediate from 2.19 and 2.9. 


2.20. THEOREM. If f(u, v) is A.C.T. on So, then Of,(u, v)/du = (Of(u, v)/du)s 
on Rio providing v does not belong to a set of measure zero which is independent 
of h. 

Proof. By 2.15, Ofs(u, v)—f(u—h, v)} on Reo 
Since f(u, v) is A.C.T., it is absolutely continuous in wu for a.e. v, OSvS1. 
Fix a value of v for which f(x, v) is absolutely continuous in u. Then 


Ofn(u, 2) 


Ou 


= = {fut hs) = 


= 


94 R. G. HELSEL AND T. RADO* [July 


2.20.1. COROLLARY. If f(u,v) is A.C.T. on So, then Of, (u,v) /Ou—> .00f (u,v) /du 
a.e. on So. 


Proof. The result is immediate from 2.20 and 2.16. 


2.21. THEOREM. If f(u,v) is A.C.T. on So, then Of,(u, v)/dv = (Of(u, v)/Ov) 
on Rio providing v does not belong to a set of measure zero which is independent 
of h. 


Proof. For 0sv31, 


1 h 
{f(uta,») —f(u+a,0)}da 


= 0) — 0). 


Therefore, on Ryo, 


a) 
Sn(u, v) = fr(u, 0) + =f on dédn. 


By 2.4, 


Ofn(u, v) af(é, 2) dé 
ov 2h u—h dv dv h 


on Ryo providing v does not belong to a set of measure zero which is inde- 
pendent of h. 


2.21.1. CoROLLARY. If f(u, v) is A.C.T. on So, then Of,(u, v)/Ov— 00 f(u, v) /dv 
a.e. on So. 


2.22. THEOREM. If f(u, v) is A.C.T. om So, then fi(u, v) is A.C.T. on Rio. 


Proof. Because of the absolute continuity of the indefinite Lebesgue inte- 
gral, f,(u, v) is absolutely continuous in u for every v, 0Sv<1; hence, 


fr) = f 


1 
| fu + hy 2) — flu — 0) | 


Ofn(u, v) du 


where M is a bound for | f(u, v)| on So. Thus V(v; fy) is summable on 0Sv<1. 
From the proof of 2.21, on Rio, 


Sr(u, = fr(u, 0) + ”) at} an 


—h 


‘ 
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Therefore, f,(u, v) is absolutely continuous in v for every u, hSus1—h, and 


V(u; fr) -f 


f 2) | dd 


Thus V(u; f,) is summable on hSu<1—A and the proof is complete. 


1 


of 
Ov 


IA 


CHAPTER III. TRANSFORMATION FORMULAS FOR DOUBLE INTEGRALS 


3.1. In this final chapter we shall take each transformation for which 
W. H. Young established either formula (3) or (4) of 1.4 and, by placing it in 
class K3, show that formula (1) of 1.1 holds if the integral on the left exists 
(cf. 1.2). Throughout the chapter we make the assumption that there is given 
a transformation 


T: x = x(u, 2), y = y(u, 2), (u, v) € So, 


which satisfies hypothesis Hy of 1.9. Additional restrictions on the trans- 
formation T will be stated in the sequel. In the last three theorems of this 
chapter (cf. 3.9, 3.12, 3.14), the conditions imposed on the coordinate func- 
tions x(u, v) and y(u, v) are not symmetric. A second theorem may be ob- 
tained in each of these cases by interchanging the réles of x(u, v) and y(u, v). 

3.2. We shall now show that certain transformations obtained from T by 
approximating to the functions x(u, v), y(u, v) with h-h-integral means belong 
to class K;. First we introduce the following notations: 


nf (te, = Sisal v), nf(u, v) = v), = = 3, 4, 
where f(u, v) is any summable function defined on So. 


3.3. LEMMA. If T satisfies Ho, the transformation 


Tm: 2), n,m > 2, (u, 2) mS, 
belongs to class Kz on »S°-mS°. 


Proof. By 2.8, v)/du, Aix(u, v)/dv, ORy(u, v)/du, v)/dv are 
continuous on »S-mS; hence, "J"=J(u, v; ;,7") is summable there. By 1.6, 
mS). 


3.3.1. COROLLARY. If T satisfies Ho, the transformation 


ala: y= 0), n > 2, (u, 0) Ew, 
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belongs to class Kz on ,,S°. 


3.3.2. COROLLARY. If T satisfies Ho, the transformation ®T™ (1/n+1/m <1/2) 
belongs to class Ks on S¢jnstjm- 


Proof. From the definition of class K;(D), if TEK;(D) and D* is any 
domain within D, then TE K;(D*). See Radé and Reichelderfer [1]. 


3.4. LemMA. If T satisfies Ho, the transformation 
"a(t, t), y = y(u, 2), n> 2, (u,v) 
belongs to class K; on ,,S°. 


Proof. Fix a value of »>2. For each positive integer m, satisfying 
1/n+1/m<1/2, define the transformation {7% on Si/n41jm. We shall verify 
that the transformations ;7 and 27% satisfy the conditions of the Closure 
Theorem (cf. 1.7). - 

(i) Clearly S?/n+1/mCnS° and, for every closed set FC,,S°, there exists an 
mo=m»(F) such that for all m>mo. 

(ii) By 2.8, Otx(u, v)/Ou and O%x(u, v)/dv are continuous on ,,.S and, there- 
fore, bounded. The summability of "J =J(u, v; 37) on ,S° follows then from 
the summability of y,(u, v) and y,(u, v) on So (cf. 2.18). 

(iii) By 3.3.2, for 1/n+1/m<1/2. 

(iv) For every closed set FC,,S° there exists a closed oriented rectangle R 
such that FCRC,S®. By 2.7, ®y(u, v)—,uniformly to y(u, v) on R. To show 
that "J” converges strongly to "J on F, we shall verify that "J”—,, "J a.e. 
on R and that {"J"} has property (UV) on R for fixed m; hence, by 2.3.1, 
SSe\"J"—"J| dudv—,,0. 

It follows directly from 2.19.1 that "J"—,, "J a.e. on R. To show property 
(VU) for {"J"} on R, it suffices to consider the product 


au 


since the other product of "J" may be handled in the same way. By 2.8, the 
first factor is bounded for fixed m and, by 2.10, {m™(dy(u, v)/dv)} has property- 
(V).on R; hence, from 2.2.2, { Ohx(u, v)/du-Ony(u, v)/dv} has property (V) on 
R for fixed n. 

The conditions of the Closure Theorem being satisfied, we conclude that 
nl €K;(,S°). 

Exchanging the réles of x(u, v) and y(u, v), we have the lemma: 


3.4.1. Lemma. If T satisfies Ho, the transformation 


x= 2x(u,0), y= 2), > 2, (u, ») oS, 
belongs to class Ky on ,,S°. 
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3.5. Our investigation of the transformations considered by Young will 
‘proceed according to the following scheme: First we approximate to the given 
transformation T by a suitable one of the transformations 477, ;7, T;. Then, 
as the reader will easily verify, all of the conditions of the Closure Theorem 
are satisfied except strong convergence of the Jacobians. Since the Jacobians 
of the approximating transformations converge a.e. to J(u, v; T) (cf. 2.19.1), 
a necessary and sufficient condition for strong convergence is that the Jaco- 
bians of the approximating transformations have property (V) (cf. 2.2, 2.3.1). 
We shall now investigate property (UV) for the Jacobians of the approximating 
transformations. 


3.6. THEOREM 1 (see Young [2, pp. 80-85], cf. 1.9). If T satisfies Ho and 
one factor in each product x,(u, v)-yo(u, 0), X»(u, V)-yu(u, v) ts bounded on So, 
then T€K3(S$) and formula (1) holds whenever the integral on the left exists. 


Proof. Consider any fixed RC.S$. Choose n large enough that RC, S. Ap- 
proximate to T by 27%. To show property (VU) for {"J"} on R, it is sufficient 
to consider the product 


Ou n( 0))-n(Yo( ty 


since the other product in "J" may be handled in the same way. By assump- 
tion, one factor of x,(u, v)-y.(u, v), say y»(u, v), is bounded on So; hence, 
*(y»(u, v)) is uniformly bounded on R. By 2.10, {%(x.(u, v))} has property (VU) 
on R. Therefore, from 2.2.2, { Onx(u, v)/Ou-Ony(u, v)/dv} has property (VU) on 
R. By the Closure Theorem, T€K3(5S¢) and, by the result of Radé and 
Reichelderfer mentioned in 1.2, the theorem follows. 


3.7. THEOREM 2 (see Young [2, pp. 80-85], cf. 1.9). If T satisfies Hy and 
in each product x,(u, v)-yo(u, 0), V)-Yu(u, v), the factors belong to associated 
Lebesgue classes on So, then TEK;(Sq) and formula (1) holds providing the 
integral on the left exists. 


Proof. Consider any fixed RC.S$. Choose m large enough that RC, S. Ap- 
proximate to T by s7s. On R 


It now follows from 2.11 and 2:2.1 that {"J"} has property (VU) on R. By the 
Closure Theorem, T€K;3(5SQ); hence, the theorem follows. 


3.8. THEOREM 3 (see Young [3, p. 163], cf. 1.9). If T satisfies Ho; |xu(u, »)|, 
| yu(u, v)| SA(u) and |xo(u, v)|, |ye(u, Su(v), where d(u) and are sum- 
mable on (0, 1), then TE K3(S) and formula (1) holds as soon as the integral 
on the left exists. 
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Proof. Consider any fixed RC.Sj. Choose large enough that RC,S. Ap- 
proximate to T by :7. To show property (VU) for {"J} on R, it suffices to 
consider the product dpx(u, v)/du-dy(u, v)/dv since the other product of "J 
may be handled in the same way. Let e be any measurable set comprised in R. 
Then 


f f 2) | dy(u, 2) 
Ou ov 
2 1/ l/n 
< { f f | tu(u + a, » + B)|- |yo(u, dude} dadB 


2 


n n 

4 
n 

n 


ff 


n? l/n l/n 
f So, | e| dad 


S QA(u)u(v), So, | e| ). 
Q has been defined in 2.10 and 
E [(u—a, 
(u, 


By the Theorem of Tonelli, \(u)u(v) is summable on So; hence, 2(A(u)u(v), So, 
|e| Thus, by the last inequality and 2.2.1, { ORx(u, v)/du-dy(u, v)/dv} 
has property (VU) on R. Finally, by the Closure Theorem, TE K3(S8) and the 
theorem follows. 


3.9. THEOREM 4 (cf. 3.1). If T satisfies Ho; | v)| SX(u), | v)| 
Sp(v), where \(u) and p(v) are summable on (0, 1); and the total variations 
V(v; y) and V(u; y) are bounded on (0, 1), then TEK3(S$) and formula (1) 
holds whenever the integral on the left exists. 


Proof. Consider any fixed RCS}. Choose m large enough that RC,S. Ap- 
proximate to T by T,. On R, | J*| Sd(u)-2| yo(u, v)| +u(v)-2| yu(u, v)|. This 
inequality, along with 2.14, 2.3, and 2.2.1, implies property (VU) for {J*} 
on R. By the Closure Theorem, TE K;(5S$), so the theorem follows. 

3.10. Young really does not discuss a situation corresponding to Theo- 
rem 4. Instead, he claims (see Young [2, p. 88]) to have established formula 
(3) of 1.4 under the conditions: x(u, v) and y(u, v) are absolutely continuous 
on every horizontal and on every vertical in So; yu(u, v), yo(u, v) are summable 
on So; | v)| SX(u) and | v)| Sp(v), where X(u), u(v) are summable 
on (0, 1) and A(u)y,(u, v), u(v)yu(", 9) are summable on So. In attempting to 
prove this assertion, Young erroneously assumes that y(u, v) can be written 
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as follows: y(u, v) =y:(u, v) —ye(u, v), where y:(u, v) and ye(u, v) possess non- 
negative first partial derivatives in addition to all of the properties of y(u, v) 
(see Young [2, p. 91]). Even if y(u, v) could be split in this manner, V(0; y) 
and V(u;y) would be bounded (as we assumed in Theorem 4) since 


1 dyi(u, * 


0 
S yi(1, 0) — yi(0, v) + yo(1, — y2(0, 0) 
and, similarly, V(u; y)Syi(u, 1)— yi(u, 0)+y2(u, 1) — yo(u, 0). 


3.11. For the purpose of the next theorem, we require the following 
lemma: 


Lemma. If f(u, v) 20 and g(u, v) 20 are summable on So and if 0<6<1/2 
ts fixed, then 
B) Sf, +a,v+ B)g(u, v)dudo 


exists a.e. and is summable in r: —isasb, —éiSBS5. 
Proof. See Young [2, p. 91]. 


3.12. THEOREM 5 (see Young [2, pp. 92-93]; cf. 1.9, 3.1). For T satisfying 
Hy and 0<5<1/2 fixed, define 


T(a, 8:0,0): «= 2(u+a,v0+ 8), y= y(u,»), (a, 8) Er, (u, 2) Se. 
Then for a.e. (a, 8) in r, T(a, B; 0, 0) EKs(S$) and the transformation formula 


f F(x(u + a, » +B), 9; B; 0, 0))dudo 
+00 +00 
f f F(x, y)-p(, ¥; Ss, T(a, B; 0, 0))dady 


holds if the integral on the left exists. 


Proof. Consider any fixed RC.S$. Choose mo large enough that RC Ss+1/. 
for n>mo. Then for (a, 8)Er and n>MNo, define 


B; 0,0): x= y= y(u,»), (u, 0) © 


We wish to show that { J(u, v;27(a, 8; 0, 0)} has property (VU) on R for a.e. 
(a, B) in r. To this end, we need only consider the product d3x(u-++a, v+8)/du 
-Oy(u, v)/dv since the other product of the Jacobian may be handled in the 
same way. 


For (a, 8) Er, define 
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G(u, v, B) =| xu(u +a,0+8)|-| (4,2) ES: 
and 
G,(u, v, a, B) = al + a,v + 6) | | yo(u, v) |, n> No, (u, 2) 
By 3.11, 
H ’ = G » VU, a, dud 
(a, B) (u, v, a, B)dudv 


exists a.e. and is summable on r. Let E be the exceptional set of points (a, 8) 
in r. From 2.9, 


(1) "H(a, 8) = Jf v, a, B)dudv fc v, a, B)dudy = F(a, B) 


for a.e. (a, 8)Er. Let Ey be the exceptional set of points (a, 8) in r. Con- 
sider a fixed point (ao, Bo) in r and not belonging to Ei+£2. By 2.9, 
G,,(u, ¥, Bo)—>n G(u, v, Bo) a.e. on R; hence, it follows from (1) and 2.3 
that {G,(u, v, ao, Bo)} has property (UV) on R. For (u, v)ER and n>mo, 


+ ao, 0 + Bo) 
Ou 


| Oy(u, v) 
dv 


G,(u, v, Qo, Bo) 


therefore, { v+Bo)/du-dy(u, v) /av} has property (VU) on R. 
Finally, by the Closure Theorem, T(a, 8; 0, 0) belongs to class K; on Sf for 
a.e. (a, 8) in r. 

To complete the theorem, we must show that for a.e. (a, 8) in r the image 
of the boundary of S; under T(a, 8; 0, 0) is a set of planar measure zero. 
Since x(u, v) is A.C.T. on So, for a.e. (a, 8) in r the function x(u+a, v+ 8) is 
absolutely continuous on the boundary of S;; hence, for such (a, 8), the con- 
tinuous image of the boundary of S; under T(a, 8; 0, 0) is a semi-rectifiable 
curve and its locus is a set of planar measure zero. 

(For the proof of Theorem 5 we need not assume that the image of the 
boundary of So under T is a set of planar measure zero.) 

3.13. To prove the final theorem, we approximate to the given trans- 
formation T by the transformation 


v), ny(u, v), n> 2, (u, Rijn 0- 


Thus, to utilize the Closure Theorem, we must establish the lemma: 
Lemma. If T satisfies Ho, the transformation ,T, belongs to class Ky on Rin o. 


Proof. By 2.22, ,x(u, v) and ,y(u, v) are A.C.T. on Ry» o. From 2.15, 
0,x(u, v)/du and @,y(u, v)/du are continuous and, therefore, bounded on 
Rijn o. The result now follows from reasoning similar to that employed in 3.6. 


. 
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3.14. THEOREM 6 (see Young [1, pp. 365-366; 2, p. 87]; cf. 1.9, 3.1). If T 
satisfies Ho and | x.(u, v)|, | yo(y, v)| Su(v), where u(v)xu(u, v) and u(v)yu(u, 2) 
are summable on So, then TG K3(Sq) and formula (1) holds providing the integral 
on the left exists. 


Proof. Consider any fixed RCS. Choose large enough that RCRi/» o. 
Approximate to T by ,7,. To show property (V) for { J(u, v; aT 2)} on R, it 
suffices to consider the product 0,x(u, v)/0u-0,y(u, v)/dv since the other prod- 
uct of the Jacobian may be handled in the same way. Let e be any measurable 
set comprised in R. Then 


SY, 


0ny(u, 2) 
dv 


< tu(u + a,2)| -| yo(u +B, dude dad 
< f f | + a, dude} dads 


2 l/n l/n 
< f f | Xu(u, ») | audo\ 


< f f (u(r) | So, | 


dudv 


0,x(u, 2) 
Ou 


MAu(v) | xu(u, So, ), 


where Q2(u(v) | v)|, So, | e| )—> 161.00. Thus, by the last inequality and 
2.2.1, {Anx(u, v)/du-dny(u, v)/dv} has property (V) on R. 

From 2.20.1 and 2.21.1, J(u, v; »2T.)—n J(u, v; T) a.e. on R; hence, as the 
reader can now easily verify, all of the conditions of the Closure Theorem are 
fulfilled. Therefore, T€K3(S$) and the theorem follows. 


3.14.1. COROLLARY (see Young [3, pp. 189-190], cf. 1.9). If T satisfies Ho; 
|xo(u, v)|, |yo(u, v)| Su(v), where u(v) is summable on (0, 1); and the total 
variations V(v; x) and V(v; y) are bounded on (0, 1), then TE K3(S) and for- 
mula (1) holds as soon as the integral on the left exists. 


Proof. Because of the Theorem of Tonelli, the relation 


{ 2) | au\ ao = f wove: x)dv 


implies the summability of u(v)x.(u, v) on So, Likewise, u(v)y.u(u, v) is sum- 
mable on So. 
3.15. The preceding six theorems account for all of Young’s results on the 


| : q 
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transformation of double integrals. Theorem 5 seems to offer a method of 


approach to the conjectured proposition (cf. 1.10): 
If T satisfies Hy and J(u, v; T) is summable on So, then TE K3(So) and 


formula (1) holds if the integral on the left exists. 

Along this line, we have considered the question of deducing Theorems 1, 
2, 3, 4, and 6 from Theorem 5. The first four theorems follow readily from the 
fifth, but we have not been able to deduce Theorem 6 from Theorem 5. 
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A THEORY OF CROSSED CHARACTERS 


BY 
REINHOLD BAER 


The characters of the finite group(") G in the cyclic group E of order(?) 
m are just the multiplicative, single-valued G to E functions. To define 
crossed characters we have to consider apart from G and £ a homomorphism 
C of G into the group of automorphisms of E; and we denote the map of the 
element g in G under C also by g. A single-valued G to E function f(g) is 
termed a C-character (or a crossed character) of G, if it is a solution of the 
following functional equation (*): f(uv)=f(u)*f(v) for u and v in G. The 
C-characters, like the ordinary characters of G, form a finite abelian group, 
the C-character group(*) of G; and as in the classical theory of characters 
one aims at establishing a duality between G and its C-character group. 
But here the similarity ends. For in the classical theory the following three 
properties are equivalent: (a) 1 is the only element in G which is mapped 
upon 1 by every character of G; (b) it is possible to establish a duality be- 
tween G and its character group; (c) G and its character group are isomorphic. 
But if one substitutes in these properties C-characters for the ordinary char- 
acters, no such equivalence holds. Though still (c) implies (b) and (b) im- 
plies (a), only a few of the groups satisfying (a) meet the requirement (b); 
and only rarely a group satisfying (b) may be shown to satisfy (c). 


Presented to the Society April 24, 1943; received by the editors December 21, 1942. 

(4) Only finite groups will be considered in this investigation; for this reason we shall use 
the term “group” always in the sense of “finite group.” 

(?) It will be apparent immediately why it is not advisable to consider some “absolute” 
value group like the group of all the roots of unity. 

(3) Apparently the first to investigate this functional equation was I. Schur. Recently 
functions meeting this requirement have been termed crossed characters, though it might have 
been more in conformity with accepted usage to term them “crossed homomorphisms” and to 
reserve the term “crossed character” for the crossed homomorphisms with values in a cyclic 
group. For investigations of this functional equation; see I. Schur, Uber die Darstellungen der 
endlichen Gruppen durch gebrochene lineare Substitutionen, J. Reine Angew. Math. vol. 127 
(1904) pp. 20-50; I. Schur, Bemerkungen zu der vorstehenden Arbeit des Herrn Speiser, Math. 
Zeit. vol. 5 (1919) pp. 7-10; A. Speiser, Zahlentheoretische Sdize aus der Gruppentheorie, Math. 
Zeit. vol. 5 (1919) pp. 1-6; R. Baer, Automorphismen von Erweiterungsgruppen, Actualités 
Scientifiques et Industrielles no. 205, 1935; S. MacLane and O. F. G. Schilling, Normal algebraic 
number fields, Trans. Amer. Math. Soc. vol. 50 (1941) pp. 295-384; A. H. Clifford and S. 
MacLane, Factor sets of a group in tts abstract unit group, Trans. Amer. Math. Soc. vol. 50 
(1941) pp. 385-406; S. MacLane and O. F. G. Schilling, A general Kummer theory for function 
fields, Duke Math. J. vol. 9 (1942) pp. 125-167. 

(*) The C-character group of the Galois group of an algebraic extension is of importance in 
the theory of radical extensions as will be shown elsewhere. 
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In Chapter III of this investigation we characterize those groups G and 
homomorphisms C which meet the requirement (a); and those satisfying con- 
ditions (b) or (c) are determined in Chapter V. In Chapter II the order of 
the C-character group is evaluated and conditions for equality of the orders 
of G and of its C-character group are derived; this latter property is later 
seen to be intermediate between (b) and (c). Chapter IV is devoted to a char- 
acterization of the groups G and homomorphisms C satisfying the following 
postulate: the C-character group of G is the only group T of C-characters of G 
such that 1 is mapped upon 1 by every C-character in T. This property stands 
between (a) and (b) and is important by its applications to the theory of 
radical extensions of fields(°). 


CHAPTER I. PRELIMINARIES 


I. 1. The classical theory of characters. In this section we give a resumé 
of the classical theory of characters of finite abelian groups in the form best 
suited to our purposes; and we indicate how to derive these statements from 
the theorems customarily found in the various texts(°). 

If G is a finite (not necessarily abelian) group, and if E is a cyclic group 
of order m, then a character of G in E is a single-valued G to E function f(g) 
satisfying the functional equation f(g)f(k) =f(gh) for g and h in G; in short, 
a character of G in E is ahhomomorphism of the group G into the cyclic group 
E. If f’ and f’’ are characters of G in E, then f’(g)f’’(g) is a character of G » 
in E which is called the product f’f’’ of these characters. It is readily seen 
that the system of all the characters of G in E£ is a finite abelian group, the 
character group of Gin E. 

Basic for many constructions of characters is the following extension prin- 
ciple. 

Lemma I.1.1. (a) If Sis a normal subgroup of the group G, and if f is a char- 
acter of the quotient group G/S in E, then there exists one and only one character 
of Gin E which maps S upon 1 and which induces f in G/S. 

(b) If G is an abelian group the orders of whose elements are divisors of m, 
if f is a character of the subgroup S of G in E, then there exists a character of G 
in E which induces f in S. 


We denote by G’ the commutator subgroup of G and by G™ the subgroup 
of G which is generated by all the mth powers of elements in G. 


THEOREM 1.1.2. G’G" is the subgroup of all the elements in G which are 
mapped upon 1 by every character of G in E. 


To prove this theorem one notes first that the characters of G in E are 


(*) This application of the present theory will be treated elsewhere. 
(*) For example, E. Hecke, Theorie der algebraischen Zahlen, Leipzig, 1923, in particular §10, 
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homomorphisms of the group G into a cyclic group of order m, and that homo- 
morphisms of this kind map both G’ and G” upon 1; and one deduces from 
Lemma I.1.1 that there exists to every element not in G’G” a character of G 
in E which does not map it upon 1. 


THEOREM I.1.3. If G is a finite group and E a cyclic group of order m, then 
each of the following properties implies the others: 

(1) i és the only element in G which is mapped upon 1 by every character 
of Gin E. 

(2) Gis an abelian group the orders of whose elements are divisors of m. 

(3) Gand its character group in E are isomorphic groups. 


The equivalence of (1) and (2) is essentially a restatement of Theorem 
1.1.2. Property (2) is a consequence of (3), since the character group of G 
in E is abelian, and since the mth power of every character of G in E is 1. 
That (2) implies (3) is a classical theorem(’). 

If S is a subgroup of G, then we denote by K(S) the group of all the char- 
acters f of G in E which map S upon 1, that is, K(S) = {f(S) = 1}; and if T 
is a group of characters, then we denote by G(T) the set of all the ele- 
ments g in G which are mapped upon 1 by every character in 7, that is, 
G(T) = {T(g) =1}.- 


THEOREM I.1.4(8). If G is a finite abelian group the orders of whose elements 
are divisors of m, tf E ts a cyclic group of order m, then 

(a) G(K(S))=S for every subgroup S of G; 

(b) K(G(T))=T for every group T of characters of G in E; 

(c) mapping the subgroup S of G upon K(S) constitutes a duality(*) of G 
upon the character group of G in E. 


RemaRk. If T is a group of characters such that G(T) =1, then every char- 
acter of G is contained in T. 

_ I. 2. Fundamental concepts. We are concerned with a cyclic group E 
of order m and a homomorphism C of the finite group G into the group of 
automorphisms of E. We shall use the same symbol g for the element g in the 
group G and the automorphism which corresponds to g under C, or as we shall 
say, the automorphism of E induced by the element g in G. Every auto- 
morphism of E consists in raising every element in E to a fixed power which 
is prime to m and which is uniquely determined modulo m; this integer 
(modulo m) which corresponds to the automorphism induced by the element 
g in G also shall be denoted by g. 

The elements in G which are mapped upon the identity automorphism of 


(7) See, for example, Hecke, op. cit. p. 36. 

(*) See, for example, Hecke, op. cit. pp. 36-38. 

(*) A duality is a monotone decreasing, 1:1 correspondence between the elements in two 
partially ordered sets. 
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E by C form a normal subgroup Gc of G; and G/G¢ is essentially the same as a 
group of automorphisms of the cyclic group EZ. This implies in particular the 
commutativity of G/Ge. 

It will be convenient to denote by Cp the homomorphism of G which maps 
every element in G upon the identity automorphism of E; clearly Cy is char- 
acterized by Gc,=G. 

The cyclic group E of order m=[],p™°» is the direct product of (uniquely 
determined) cyclic groups E(p) of prime power order p™‘”). Every automor- 
phism of E induces an automorphism in every E(p). If p is some prime, then 
we denote by Cp the homomorphism of G which maps the element g in G 
upon the automorphism of E(p) which the automorphism g of E induces in 
E(p). It is readily seen that Ge is the cross-cut of all the Ge». 

A C-character of G (in E) is a single-valued G to E function f(g) which satis- 
fies the functional equation f(gh)=f(g)*f(h) for g and h in G. Clearly the 
Co-characters of G are just the ordinary characters of G in E which we dis- 
cussed in I.1. 

If f’ and f’’ are C-characters of G, then f’(g)f’’(g) may be verified to be a 
C-character too; and this C-character of G is termed the product f’f’’. The 
set of all C-characters of G thus forms a finite abelian group, the C-character 
group L=L(G, C) of G; and the orders of the C-characters of G are divisors 
of the order m of E. 

The order of a C-character f of G is a power of the prime number if, 
and only if, f is at the same time a Cp-character of G, since f is of order a power 
of p if, and only if, f maps G upon a subgroup of E(p). 

If v is any element in EZ, then f(g) =v'~? for g in G is a C-character of G, 
since = Such C-characters of G have been termed 
unit or principal characters ; and the group of all these characters may be called 
the principal genus. It is a cyclic group which is generated by the unit char- 
acter e'~* for e a generator of the cyclic group E£. 


Lemna I.2.1. If f is a C-character of G, and if G(f) ts the set of all the ele- 
ments g in G which satisfy f(g) =1, then (a) G(f) is a subgroup of G, and (b) 
f(g) =f(A) if, and only if, is in G(f). 


REMARK. It is easy to construct examples of C-characters f such that G(f) 
is not a normal subgroup of G. 

Proof. If g and h are both in G(f), then f(gh) =f(g)*f(k) =1 so that gh also 
belongs to G(f), proving (a). From 1=f(1) =f(sh-) =f(h)*'f(h-") one infers 
that f(gh-) =f(g)* and this makes contention (b) 
evident. 


Lemma I.2.2. If the automorphism a of the group G has the property that g 
and g* are mapped by C upon the same automorphism of E, then the function 
f*(x) =f(x*) is a C-character of G whenever f(x) is a C-character of G; and in this 


1943] THEORY OF CROSSED CHARACTERS 107 


fashion the automorphism a of G induces an automorphism a of the C-character 
group of G. 


REMARK. The inner automorphisms of G meet the requirement imposed 
upon the automorphism @ in the lemma, since the group of automorphisms 
of the cyclic group E is abelian. 

Proof. If f is a C-character of G, and if g and g* induce the same auto- 
morphism of E, then f*(gh) =f(g*h*) =f(g*)"“f(h*) =f*(g)*f*(h), as was to be 
shown. 

If » is a prime number, then we denote by H(p)=H(G, p) the set of all 
the elements g in G which are mapped upon 1 by all the Cp-characters of G. It 
is a fairly immediate consequence of Lemma I.2.2. that H() is a normal sub- 
group of G, since with f(x) every f(g-'xg) =f’(x) for g in G is a Cp-character 
of G. 

The cross-cut H of all the H(p) is just the set of all the elements g in G 
which are mapped upon 1 by every C-character of G, since every C-character 
of G is a product of Cp-characters of G, as the Cp-characters of G are exactly 
the C-characters of order a power of p. 

The cross-cut H(p) of all the H(g) for g#p will prove important occa- 
sionally; this normal subgroup of G is readily seen to consist of all those ele- 
ments g in G which are mapped upon elements in E(p) by every C-character 
of G. 

The determination of the subgroups H(p) will be one of our most impor- 
tant problems (see III.2 below); in this direction we now prove the following 
statement. 


I.2.3. H(p) for every prime number p. 


Proof. If the element g in G does not belong to Gce,, then it induces in 
E(p) an automorphism which does not leave every element invariant. Thus 
there exists in E(p) an element v such that v¥v’. The C-character f(x) =v'-? is 
a Cp-character and satisfies f(g) #1 so that g does not belong to H(p). 


Coro.uary I.2.4. HSGe. 


This is an immediate consequence of Lemma I.2.3, since Gg is the cross- 
cut of the Gc,, and since H is the cross-cut of the H(). 

I. 3. C-complete groups and the extension of C-characters. The group G 
is termed C-complete, if 1 is the only element in G which is mapped upon 1 
by every C-character of G, that is, if the subgroup H, introduced in I.2, is 
equal to 1. 


THEOREM I.3.1. The group G is C-complete if, and only if, the following three 
conditions are satisfied by G and C: 
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(i) Ge is abelian; 
(ii) the orders of the elements in Gc are divisors of the order m of E; 
(iii) every (Co-)character of Gc in E is induced by a C-character of G. 


Proof. Every C-character of G induces in Ge an ordinary, that is, a Co- 
character in E. If G is C-complete, then 1 is the only element in Ge mapped 
upon 1 by every Co-character of Gc which is induced by C-characters of G. 
The necessity of conditions (i), (ii) is now a consequence of Theorem I.1.3 and 
the necessity of condition (iii) may be deduced from Theorem I.1.4. 

Assume conversely that conditions (i) to (iii) are satisfied by G and C. 
It is a consequence of Corollary 1.2.4 that only elements in Gc are mapped 
upon 1 by every C-character of G. If g¥1 is an element in Ge, then we infer 
from conditions (i), (ii) and Theorem I.1.4 the existence of a Co-character d of 
Ge in E such that d(g) #1; and there exists by (iii) a C-character f of G, in- 
ducing d in Ge, so that in particular f(g) #1. Hence G is C-complete. 


Coro.iary I.3.2. If Gis C-complete, then 
(iv) g-'xg=x* for x in Gc and g in G where the exponent g is the modulo m 
uniquely determined integer corresponding to the element g under C. 


Proof. If x and y are elements in Ge, and if g is an element in G, then 
g and xgy induce the same automorphism in E. Hence C-characters f of 
G satisfy f(g-'xgx—*) “f(g)* = F(a) f(x) f(x-*) 
=f(g-'g)f(x)f(x)-*=1, since f is a Co-character on Gc, and since x and x~? 
are in Gc. Thus g~'xgx~* is mapped upon 1 by every C-character of G; and 
this implies g~'xgx~* = 1, since G is C-complete, as was to be shown. 

RemaARK. If S is a subgroup of Ge, and if the conditions (i), (ii), (iv) are 
satisfied by G and C, then S ts a normal subgroup of G. 


Lemma I.3.3. If the normal subgroup S of G is part of Gc, then the homo- 
morphism C of G induces a homomorphism C of G/S; and every C-character of 
G/S is induced by one and only one C-character of G which maps S upon 1. 


The proof is obvious. 


Lemma I.3.4. If the conditions (i), (ii), (iv) are satisfied by G and C, if S 
is a subgroup of Gc and T a subgroup of G such that G=ST, if the C-characters 
s and t of S and T respectively coincide on the cross-cut S(\T of S and T, then 
there exists one and only one C-character of G which induces s in S and t in T. 


Proof. Every element g in G has the form uv for u in S and v in T since S 
is a normal subgroup of G=ST. If u, u’ are elements in S and », v’ elements 
in T such that uv=wu’v’, then v and v’ induce the same automorphism in E, 
uu’ =vv'—' is an element in S(\T, and s is a Co-character in S; and hence 


we find that 
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$(u)*t(v) = = 
= = s(u’)*’t(r’). 


This shows that f(uv) =s(u)*t(v) for « in S and v in T is a single-valued G 
to E function. 
If u, wu’ are in S and », v’ are in T, then 


f(uvu’r’) = f(uvu'v vv’) = f(unu’ ov’) = 
= 
= [s(m)*t(v) = 
= 


proving that f is the desired C-character of G. 

That f is the only C-character of G meeting our requirements is obvious; 
and likewise it is obvious that the condition of the equality of s and ¢ on 
SC\T is indispensable. 

If the conditions (i), (ii) and (iv) are satisfied by the group G and the 
homomorphism C, then there exists one and essentially only one smallest 
group M which contains Ge and which is a direct product of cyclic groups of 
order m. If x is an element in G, then there corresponds to x under C a 
modulo m uniquely determined integer x; and thus y* is a well determined 
element in M, whenever x is in G and y is in M. There exists therefore one 
and essentially only one group V which contains G and M as subgroups, 
satisfies V= MG and the rule x—'yx = y* for x in G and y in M. This group V 
we shall always denote by MG; and we note that M is a normal subgroup of 
MG whose cross-cut with G is exactly Ge. 


THEOREM I.3.5. If the conditions (i), (ii) of Theorem 1.3.1 and condition (iv) 
of Corollary 1.3.2 are satisfied by the group G and the homomorphism C, then the 
following property of MG is a necessary and sufficient condition for C-complete- 
ness of G: 

(v) There exists a subgroup R of MG such that MG=MR and M(\R=1. 


Remark. The elements of the subgroup R figuring in (v) form a set of 
representatives of the cosets of MG modulo M. Borrowing a term from the 
theory of extensions one may therefore restate (v) as follows: M splits G. 

Proof. Since M(/\G=Gc, there exists a uniquely determined homomor- 
phism of MG into the group of automorphisms of E which coincides with C 
on G and which maps every element in M upon the identity automorphism. 
This extension of the homomorphism C we denote by C too; and we mention 
that (MG)c=M. 

(+) G is C-complete if, and only if, MG is C-complete. 

If MG is C-complete, then G is C-complete, since every C-character of MG 
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induces a C-character in its subgroup G. If conversely G is C-complete, and 
if fo is a Co-character of M in E, then fo induces in Gg a Co-character f;; and 
we infer from Theorem I.3.1 (iii) the existence of a C-character fz of G which 
induces f; in Gc. Since fo and fz coincide on the cross-cut Ge of M and G, 
since M=(MG)c, and since conditions (i), (ii), (iv) are satisfied by MG 
and C, we deduce from Lemma I.3.4 the existence of a C-character f of MG 
which coincides with fz on G and with fo on M. Thus conditions (i), (ii), (iii) 
of Theorem I.3.1 are satisfied by MG and C; and MG is C-complete, if G is 
C-complete. 

(++) MG is C-compleie if, and only if, condition (v) is satisfied by MG. 

If condition (v) is satisfied by MG, then there exists a subgroup R of MG 
such that MG=MR and M/\R=1. Since conditions (i), (ii) and (iv) are 
satisfied by MG and C, and since (MG)c¢= WM, we deduce from Lemma I.3.4 
that every Co-character of M in E is induced by one and only one C-character 
of MG which maps R upon 1. Thus conditions (i), (ii), (iii) are satisfied by 
MG and C; and we have shown that the C-completeness of MG is a conse- 
quence of condition (v). 

For the proof of the converse we need two lemmas which will prove useful 
in later investigations. If g is an element in G and f a C-character of G, then 


we put: 
(*) -F(f) = f(g). 


Lema I.3.5.1. F, is, for every g in G, an ordinary character of the C-char- 
acter group of Gin E. 


This is an immediate consequence of the definition of product of C-char- 
acters. 


Lema I.3.5.2. If Gis C-complete, then g’=g'' is a necessary and sufficient 
condition for Fy, = Fy. 


F, =F, if, and only if, f(g’) =f(g’’) for every C-character f of G. We de- 
duce from Lemma I.2.1 that this is the case if, and only if, f(g’g’’-*) =1 for 
every C-character f of G; and this last fact is equivalent to g’ =g’’, since G is 
C-complete. 

We suppose now that MG be C-complete; and we denote by K the C-char- 
acter group of MG, by L the group of ordinary characters of K in E. It is a 
consequence of Lemma I.3.5.1 that F, for x in MG is an element in L. If 
r, s are elements in M, then F,,.(f) =f(rs) =f(r)*f(s) =f(r) f(s) = F-(f) F.(f) or 
F,,= F,F,; and we infer from Lemma I.3.5.2 that F, is an isomorphism of M 
upon the subgroup Fy of L, since MG is C-complete. Since E is of order m, 
it follows that L™=1. Since M and therefore Fy is a direct product of cyclic 
groups of the maximum order m in L, it follows that Fy is a direct factor of L, 
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proving the existence of a subgroup L’ of L such that L is the direct product 
of Fy and L’. 

We select in every coset X of MG/M a representative r(X), in particular 
r(M)=1. If A and B are cosets of MG/M, then (A, B) =(rAB)-'7(A)r(B) 
is an element in M. Considering that C maps all the elements in the coset X 
upon the same automorphism X of E, we find for every C-character f of MG: 


(4,3)(f) = f(r(AB))f((A, B)) = f(r(A, B)) 4 B)) 
= f(r(AB)(A, B)) = f(r(A)r(B)) = f(r(A))f(r(B)) 
= 


Since F,,x) is an element in the direct product Z of Fy and L’, there exist 
elements c(X) in M and c’(X) in L’ such that F,,x) = F.;x)c’(X). Since (A, B) 
is in M, since the cross-cut of L’ and Fy is 1, we deduce now that c’(AB) 
=c'(A)®c'(B) and that = Fe). Since we have shown before 
that F effects an isomorphism between M and Fy, we may infer from the last 
identity the following equation: c(A B)(A, B) =c(A)c(B) for A, BinGM/M. 

Now we introduce new representatives of the cosets of MG/M by the 
definition: r’(X) =r(X)c(X)— for X in MG/M. Then 


r'(AB) = r(AB)c(AB)— = r(AB)(A, B)c(A)-8c(B) 
= = 
= 1'(A)r'(B); 


and this shows that the representatives r’(X) of MG/M form a subgroup of 
MG, that is (v) is satisfied by MG. This completes the proof of (++); and 
our theorem is an immediate consequence of (+) and (++). 

The criterion (v) for C-completeness has the disadvantage of not being an 
“interior” property of G, since it involves extensions of the group G. This 
makes this criterion hard to handle in applications and for this reason we 
shall attack the problem from a different angle in Chapter III, in particular 
Theorem III.3.3. 

The last paragraph of the proof of Theorem I.3.5 as well as the criterion 
(v) contain a comparatively simple method for actual construction of C-com- 
plete groups. This may be indicated briefly: If G is a C-complete group, then 
Gc is an abelian group which we shall denote by B, and B is part of the group 
M which is a direct product of cyclic groups of order m. The quotient group 
G/Ge is a group A of automorphisms of the cyclic group E of order m and is 
at the same time a group of automorphisms of M, since we may identify every 
element x in A with an integer modulo m that is prime to m. To determine 
G completely as an extension of B by A, realizing the given automorphisms 
of B, we need only the “factor sets” (a, 6) for a and 6 in A which had been 
discussed in the last paragraph of the proof of Theorem I.3.5, and criterion 
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(v) states—as has been pointed out during this proof—exactly that there 
exists a single-valued A to M function c(x), satisfying: 


ee c(x)"c(y) = c(xy)(x, y) for x and y in A. 


Putting h(x) = Bc(x) for x in A we obtain a single-valued A to M/B function 
satisfying h(x)h(y) =h(xy) for x, y in A, that is a crossed homomorphism 
of A into M/B. 

If c’(x) is another solution of the functional equation (**), then the func- 
tion satisfies and is therefore a 
crossed homomorphism of A into M. Passing from the factor set (a, 5) to an 
“associated” factor set (a, b)’=b(y)—'b(x)—"b(xy)(x, y) for b(z) in B clearly 
does not change the crossed homomorphisms of A into M/B at all. 

That conversely every crossed homomorphism of A into M/B may be 
realized in the fashion just described is readily seen. 


CHAPTER II. THE C-CHARACTER GROUP 


It is the object of this chapter to give a somewhat crude description of the 
group of all the C-characters of a group G. This discussion, however, will be 
sufficient for determining the order of the C-character group of G, in case G 
is C-complete. 

II. 1. The automorphisms of a cyclic group. For the convenience of the 
reader we collect here a number of known facts concerning the automor- 
phisms of finite cyclic groups(?*). 

If E is a cyclic group of order m0, then the group of all the automor- 
phisms of E is a finite abelian group. If g is an automorphism of EZ, then g 
determines and is determined by a modulo m uniquely determined number g. 
Automorphism and number g are related to each other by the fact that the 
automorphism g of E maps every element in E upon its gth power. A number 
belongs to an automorphism if, and only if, it is prime to m. This relation 
between automorphisms and numbers is an isomorphism between the group 
of automorphisms of E and the multiplicative group of integers prime to m 
modulo m. 

It is sufficient to discuss the case of a primary group E. Then the order 
of E is a prime power m=p*. 

Case A. p is odd. Then the group of all the automorphisms of E is a cyclic 
group of order p*—"(p—1). 

Each of the following properties of an automorphism g of E and of the 
corresponding integer g modulo m implies all the others: 

(1) The automorphism g is not of order a power of p. 

(2) 1 is the only element in E which is left invariant by g. 

(3) The integer g is not congruent to 1 modulo p. 


(#°) See, for example, Hecke, op. cit. §13. 
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It will be convenient to term the automorphisms of order a power of p 
regular automorphisms of E and those meeting the requirements (1) to (3) 
irregular automorphisms of E. 

If g is an irregular automorphism of E, then g—1 is prime to p and the 
order nm of the automorphism g is the least positive integer k such that 
(g*—1)/(g—1) =0 modulo m. 

The (regular) automorphism g is of order p*~‘ for 0<j <u if, and only if, 
the integer g is congruent to 1+%g’ modulo m for g’ an integer prime to p. 
Then —1)/(g—1) is divisible by p*-/, but not by 

Case B. p =2. If E is of orders 1 or 2, then the identity is the only automor- 
phism of E. Thus we assume that 1<y. Then E contains one and only one 
element of order 2 and this is left invariant by every automorphism of E. 

The automorphism of E which maps every element in E£ upon its inverse 
shall be termed the inversion of E and shall be dencted by z. The group of all 
the automorphisms of E is the direct product of a cyclic group of order 2#-? 
and of the group of order 2 which is generated by the inversion z. 

The following two properties of an automorphism g of E and of the corre- 
sponding integer g modulo m are equivalent: 

(i) The elements of orders 1 and 2 are the only elements in E which are 
left invariant by the automorphism g. 

(ii) g=—1 modulo 4. 

We shall term the automorphisms of E meeting the requirements (i), (ii) 
irregular automorphisms of E, the others regular automorphisms of E. We note 
that the inversion is an irregular automorphism. 

If the automorphism g of E is different both from the identity and the in- 
version, then the corresponding integer gis of the form g= +1+2/g’ modulo m 
where g’ is odd and 1<j<y; and the order of the automorphism g is 2*~'. 

If g is a regular automorphism of order 2/#1, then (g? —1)/(g—1) is di- 
visible by 2‘, though not by 2/*'. If g is an irregular automorphism of order 2’, 
not the inversion, then (g? —1)/ (g—1) is divisible by 2*~', but not by 2+. 

We extend the definition of a regular automorphism of E, in case E is of 
orders 1-or 2, by saying that the identity automorphism is always regular. 

The classification of automorphisms of E leads naturally to a classification 
of the homomorphisms C. 

Suppose that E is a cyclic group of order m=][,p™”, E(p) the subgroup 
of prime power order p™”), that G is a finite group and that C is a homomor- 
phism of G into the group of automorphisms of E. If every element in G-. in- 
duces (under C) in every E(p) a regular automorphism, then C is termed 
regular, otherwise irregular. If in particular 1<m(2) and an element in G in- 
duces the inversion in E(2), then C is said to be singular (so that a singular C 
is always irregular). We note that every G/Gc,is of order a power of », if C is 
regular, though this condition is not sufficient for regularity, since G/Ge: is 
always of order a power of 2. 


} 
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II. 2. The C-characters of automorphism groups of primary cyclic groups. 
In this section we are concerned with a complete determination of the C-char- 
acters of a group G which is mapped isomorphically by C upon a group of 
automorphisms of the cyclic group E of prime power order. 


THEOREM II.2.1. Suppose that Cis a non-singular isomorphism of the group 
G into the group of automorphisms of the cyclic group E of prime power order 
m= p*, 

(a) Every C-character of G belongs to the principal genus so that the C-charac- 
ter group of Gis a cyclic group. 

(b) If C is regular, then G and its C-character group are of equal order. 

(c’) If C is irregular and p is odd, then the C-character group of G is of 
order m= p*. 

(c’’) If C is irregular and p=2, then the C-character group of G is of order 


Proof. We may assume throughout that G¥1. 

Case A(""). p is odd. Then the group of all the automorphisms of E is a 
cyclic group. Hence G is a cyclic group, generated by some element g. 

Case A’. C is regular. Then the integer g corresponding to the automor- 
phism g of E is of the form g=1+ %g’ modulo m where g’ is prime to p and 
0 <j <y; and the order of g (and of G) is p*-i=n. 

If f is a C-character of G, then 1=f(g")=f(g)* where the exponent 
s=1i+g+ --- is divisible by p*-i, but by no 
higher power of p. Consequently f(g) is an element in E*® =E'-*. There 
exists therefore an element v in E such that f(g) =v'~¢ and it is readily verified 
that f(g*) =v'-*. Thus every C-character of G belongs to the principal genus; 
and G and its C-character group are of the same order p*~’, since the unit char- 
acter e'~* for e a generator of E is of order p*~’. 

Case A’’. C is irregular. Then neither G nor g is of order a power of p. 
Hence g—1 is prime to p so that E=E*~*. There exists therefore to every 
C-character f of G an element v in E such that f(g) =v'~*; and it is readily seen 
that f(g‘) =v'-#*. Thus every C-character of G belongs to the principal genus; 
and the order of the C-character group is m, since e'~* for e a generator of E 
is of order m. 

Case B. p=2. G contains just one element of order 2, since G does not 
contain the inversion of E. Hence G is cyclic (and 2<y). Denote by g some 
element generating G. 

Case B’. C is regular. Then g=1+2%g’ modulo m where g’ is odd and 
1 <j <y; and an exact repetition of the argument used in Case A’ shows that 
every C-character of G belongs to the principal genus, and that G and its 
C-character group are of the same order 2*~', 


(#1) These cases refer to the analogous cases in II.1. 
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Case B’’. C is irregular. This occurs if, and only if, g=—1 modulo 4, 
that is if, and only if, g is of the form —1-+2%g’ where g’ is odd and 1<j<u, 
since g is not the inversion of E. Then 1—g=2—2‘g’=2g’’ modulo m where 
g’’ is odd. Hence E'~*= E* showing that the order of the principal genus is 
2-1, The order of the automorphism g and of the group G is 2*-i=n. Thus 
every C-character f of G satisfies 1=f(g")=f(g)* where the exponent 
s=1+g+ --- is divisible by 2*—', but not by 2+. 
From this fact we deduce that f(g) is an element in E?=E!~°; and it follows 
as usual that f belongs to the principal genus, completing the proof. 


THEOREM II.2.2. Suppose that C is a singular isomorphism of the group G 
into the group of automorphisms of the cyclic group E of order m=2+. 

(a) The principal genus is of order 2"! and the C-character group of G is 
of order 2". 

(b’) If G is cyclic (of order 2), then the C-character group of G is cyclic. 

(b’’) If G is not cyclic, then the C-character group of G is the direct product 
of the principal genus and of the cyclic group of order 2 which consists of all the 
C-characters of G mapping the inversion z upon 1. 


Proof. We distinguish two cases. 

Case 1. G is cyclic. Then G consists of the elements z and z?=1, since z 
generates a direct factor of the group of all the automorphisms of E. If v is 
any element in E, then f(z) =v defines a C-character of G because of f(z)*f(z) 
= 1. Hence the order of the C-character group of G is m. If u is the unit char- 
acter e'-*, then u(z) =e!-*=e? so that the principal genus is of order 24-'. 

Case 2. G is not cyclic. Then there exists a basis of the abelian group G 
which contains z, and which therefore has the form z, g where g is an element 
of order n= 2*-i with 1<j <u. 

If f is a C-character of G, then f(g)—*f(2) =f(g)*f(2) =f(ez) =f(2g) =f(2) f(g) 
or 


(+) = f@)**. 


Put v=f(z). Then v'~*=v?=f(z)? and and f(x)? is 
a consequence of the fact that z and g generate G. Thus we have shown that 
the square of every C-character belongs to the principal genus. 

It is a consequence of Theorem II.2.1 that every C-character of the cyclic 
group generated by g belongs to the principal genus. Hence there exists to 
every C-character f of G an element v in E such that f(g) =v'~*; and we de- 
duce from (+) that v?-9 =f(z)!-¢, Since E is cyclic and g an automorphism, 
not the identity, this implies that f(z) is in E*. If wis an element in EZ such that 
f(z) =w?, then the C-character f’(x) =f(x)w*—' has the property that f’(z) =1; 
and we have shown that every C-character f of G is modulo the principal 
genus congruent to a C-character of G which maps z upon 1. 

Suppose now that r is a C-character of G such that r(z) =1. Then it follows 


| 
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from (+) that r(g)?=1. If r(g)=1, then r=1. Since the element of order 2 
in E is left invariant by every automorphism of E, there exists one and only 
one C-character of G (which is at the same time a Cy-character of G) which 
maps z upon 1 and g upon the element of order 2. Hence the principal genus 
is of index 2 in the C-character group of G. If the C-character r of G satisfies 
r(z) =1r(g), then r cannot be in the principal genus, since otherwise there 
existed an element s in E, satisfying s?=s!~*=r(z) =1 and s'-*=r(g) 1, an 
impossibility since g is odd and 1—g is even. Thus we have shown that the 
C-character group is the direct product of the principal genus and of a cyclic 
group of order 2. The order of the principal genus is 2*—!, since E?= E!-*; and 
this completes the proof. 

The following statement is an immediate consequence of the two pre- 
ceding theorems. 


Coro.iary II.2.3. Suppose that C is an isomorphism of the group G into 
the group of automorphisms of the cyclic group E of prime power order m= p*. 

(a) The C-character group of G is cyclic if, and only if, G is cyclic. 

(b) The C-character group of G is equal to the principal genus if, and only if, 
C is not singular. 


We conclude this section with a proof of the following fact. 


Coro.iary II.2.4. Suppose that C is an isomorphism of the group G into 
the group of automorphisms of the cyclic group E of prime power order m= p*. 
Then the C-character f of G belongs to the principal genus if, and only if, one of . 
the following conditions is satisfied: 

(1) 4 does not divide m; 

(2) 4 is a divisor of m, but none of the elements in G induces in E the inver- 
sion 2; 
(3) 44s a divisor of m, the group G contains an element 2 which induces in E 
the inversion z and f(z) is in E?, f(g) =f(z)“-®2”" for every g in G. 


Proof. It is an immediate consequence of Corollary II.2.3(b) that every 
C-character of G is a principal character, if condition (1) or (2) is satisfied. 
Thus we assume now that neither (1) nor (2) be satisfied by G and C. If (3) 
is satisfied by the C-character f, then there exists an element v in E such 
that f(z) =v?; and f(g) = f(z) 0-02 =yl-o for every g in G is a principal char- 
acter. If conversely f is a principal character, then there exists an element e 
in E such that f(x) =e'-* for every x in G. In particular f(z) =e!-* =e? is in E? 
and f(g) =e!-* =f(z)"-92"", since g is odd and 1—g is even. 

II.3.. The principal genus. It is obvious that C-characters in the principal 
genus map Gc upon 1. The problem to be discussed in this section is therefore 
that of characterizing the principal genus among the C-characters mapping 
Ge upon 1. We use the following notations: 

(I1.3.*) E is a cyclic group of order m=|[,p™”, E(p) the cyclic subgroup 
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of prime power order p™”) of E; G is a group and C a homomorphism of G 
into the group of automorphisms of E. 


THEOREM II.3.1. The C-character f of G belongs to the principal genus if, 
and only if, the following conditions are satisfied by f: 

(1) af the automorphism induced by the element g in G leaves every element 
in E(p) invariant, then the order of f(g) is prime to p; 

(2) af 4 is a divisor of m, and if the element 2 in G induces the inversion in 
E(2), then f(z) belongs to E? and f(g)"2~"° =f(z)0-9m2-" for every gin G. 


Proof. If f is any C-character of G, then f»"-”” =f, is a Cp-character of G. 
If f is a principal character, then there exists an element v, in E(p) such that 
f(x) =v}~* is a principal Cp-character of G. Consequently f,(x) =1 for x in 
Ge», or equivalently, the order of f(x) is prime to p for x in Gey. But x is 
in Gc, if, aad only if, the automorphism induced by x leaves every element in 
E(p) invariant, showing the necessity of (1). The necessity of (2) is an im- 
mediate consequence of Corollary II.2.4 (3). If conversely conditions (1), (2) 
are satisfied by f, then we infer from (1) that f, maps Gc, upon 1; and hence 
it follows from (2) and Corollary I1.2.4 (3) that f, is a principal Cp-character 
of G. Thus there exists, for every prime p, an element v, in E(p) such that 
f(x) =v)” for every x in G. Since mp-”) is prime to p, there exists an integer 
4(p) such that mp-™”4(p) =1 modulo p™”). Put v=|[ vi. Then it is readily 
seen that f(x) =v'-*for every x in G, showing the sufficiency of the conditions 
(1), (2). 

In the enunciation of the next theorem we make use of the fundamental 
subgroups H(p), introduced in I.2. We note that H(p) is exactly the set of all 
the elements in G which are mapped upon 1 by every Cp-character of G; and 
that H(p) S$Gc, by Lemma I.2.3. 


THEOREM II.3.2(!*). The principal genus consists of all the C-characters of G 
which map Gc upon 1 if, and only tf, 

(1) H(p)Gc=Gey for every p, and 

(2) Cts not singular. 


REMARK. If E is of prime power order, and if G is C-complete, then condi- 
tion (1) is automatically satisfied. 

Proof. It has been pointed out before that H(p) S H(p) Ge SGep; and it 
follows from the very definition of H(p) that G/H(p) is a Cp-complete group. 
Thus it follows from Theorem I.3.1 that Gc,/H(p) is an abelian group the 
orders of whose elements are divisors of p™”), and that every Co-character of 
Gc,/H(p) is induced by a Cp-character of G/H(p). We note furthermore the 
obvious fact that every Cp-character of G/H(p) is induced by one and only 
one Cp-character of G, since Cp-characters of G map H(p) upon 1. 


(2) Corollary I11.2.3 below improves up6n this theorem. 


} 
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If H(p)Gce<Gep, then it follows from Theorem I.1.4 that there exists a 
Co-character of Gc,/H(p) which maps H(p)Ge upon 1, though it does not 
map every element in Gc, upon 1; and consequently there would exist a 
Cp-character of G which does not map Gc, upon 1, though it maps H(p)Ge 
upon 1. It is obvious that such a C-character maps Ge upon 1 without belong- 
ing to the principal genus, proving the necessity of (1). 

If 1<m/(2), and if there exists in G an element z inducing the inversion 
in E(2), then we infer from Corollary I1.2.3 (b) the existence of a C2-character 
of G which maps Gce2 upon 1 and which does not belong to the principal genus, 
proving the necessity of (2). 

Suppose conversely that conditions (1), (2) are satisfied by G; and that 
the C-character f of G maps Ge upon 1. Clearly f is the product of uniquely 
determined Cp-characters f,, since a C-character is a Cp-character if, and only 
if, its order is a power of p. Since f, is a suitable power of f, it follows that f, 
maps Ge upon 1. Since H(p) is mapped upon 1 by every Cp-character of G, 
we infer from (1) that Gc, is mapped upon 1 by f». Now it follows from (2) 
and Corollary I1.2.3(b) that every f, belongs to the principal genus, showing 
that the product f of the f, belongs to the principal genus too. The conditions 
(1), (2) are therefore sufficient. 


Coro iary II.3.3. The principal genus consists of all the C-characters of G 
which map Gc upon 1, if the following two conditions are satisfied: 


(1) af the automorphism g corresponding to the element g in G leaves the ele- 
ments in E(p) invariant, then its order is prime to p; 
(2) Cis not singular. 


Proof. Suppose that g is an element in Gc,. Then it follows from (1) that 
its order modulo G¢ is prime to p. Consequently g = g’g’’ where the order of g’ 
is prime to p, the order of g’’ is a power of p, and where g’’ belongs to Ge. 
Since g’ is an element in Gc, whose order is prime to p, and since Cp-charac- 
ters of G induce Co-characters in Gc», it follows that g’ belongs to H(p); and g 
belongs therefore to H(p)Gc, proving that Gc, =H(p)Gce. Our contention is 
now an immediate consequence of Theorem II.3.2. 

II.4. Imbedding of C-complete groups into direct products. It will be well 
to remember that C-complete groups may be characterized in two equivalent 
ways: a group G is C-complete if 1 is the only element mapped upon 1 by every 
C-character of G; and this is the case if, and only if, 1 is the cross-cut of the 
subgroups H(p). 

The following concepts will prove convenient for the enunciation of our 
imbedding theorem: suppose that E(p) is a cyclic group of order p™») and 
that C, is a homomorphism of the group G, into the group of automorphisms 
of E(p), where the primes p range over all the prime divisors of some in- 
teger m. We form the direct product E of the E(p), the direct product G* of 
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the groups G, and we define the homomorphism C* of G* into the group of 
automorphisms of E as follows: if [],g, for g, in G, is an element in G*, then 
the automorphism of £ corresponding under C* to this element induces in 
E(p) the same automorphism as the automorphism corresponding under C, 
to g,. This homomorphism C* will always be referred to as the direct product of 
the homomorphisms C,. 


THEOREM II.4.1. If C is a homomorphism of the group G into the group of 
automorphisms of E, and if G is C-complete, then G may be imbedded into the 
direct product of the quotient groups G/H(p) in such a way that the direct product 
of the homomorphism Cp induces C in G. 


Proof. The homomorphism Cp of G into the group of automorphisms of 
E(p) is well defined on G, =G/H(p), since H(p) S$Gc, by Lemma I.2.3. We 
denote by G* the direct product of the groups G, and by C* the homomor- 
phism of G* into the group of automorphisms of E which we defined as the 
direct product of the homomorphisms Cp. 

If g is an element in G, then put g* =] ],H(p)g an element in G*; and it is 
readily seen that a homomorphism of G upon a subgroup of G* is defined by 
mapping g upon g*. If g*=1, then g is contained in every H(p) and g=1isa 
consequence of the C-completeness of G. Thus G has been mapped isomorphi- 
cally upon a subgroup of G*. The automorphism of E corresponding under C* 
to g* induces in E(p) the same automorphism which corresponds to g un- 


der Cp, that is the automorphism of E corresponding to g* under C* is the 
same as the automorphism of E which corresponds to g under C; and this 
completes the proof. 

If we denote by h(p) the index of H(p) in G, that is the order of G/H(p), 
then the following statement is an immediate consequence of Theorem II.4.1. 


Coro.iary I1.4.2. If G is C-complete, then the l.c.m. of the h(p) is a divisor 
. of the order of G; and the order of G is a divisor of the product of the h(p). 


We have introduced the cross-cut H(p) of all the H(q) for g#p; this sub- 
group H(p) consists exactly of those elements in G which are mapped upon 
elements in E(p) by every C-character of G. If Gis C-complete, then the cross- 
cut of H(p) and H(p) is 1 so that a coset of G/H(p) contains at most one ele- 
ment in H(p). 


THEOREM I1:4.3. If G is C-complete, then each of the following properties 
implies all the others: 

(1) G is (essentially) the direct product of the G/H(p). 

(2) The order of Gis |] ,h(p). 

(3) Every coset of G/H(p) contains an element in H(p). 

(4) G ts the direct product of the H(p); and H(p) is the direct product of the 


H(q) for 9p. 


= 
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Proof. The equivalence of (1) and (2) is an obvious inference from Corol- 
lary I1.4.2 (and Theorem II.4.1). To prove the equivalence of (1) and (3) we 
consider the map of G into the direct product G* of the quotient groups 
G/H(p) which we used in the proof of Theorem II.4.1. Clearly G is the direct 
product of the G/H(p) if, and only if, G is mapped by this transformation 
upon the whole group G*; and this is the case if, and only if, the following 
condition is satisfied: 

(3’) If C(p) is, for every prime ~, a coset of G/H(p), then the cross-cut 
of the sets C(p) is not empty. 

Suppose now that (3’) is satisfied; and that, for some given p, U(p) is a 
coset of G/H(p). Put U(q) =H(q) for g#p. Then there exists by (3’) an ele- 
ment « contained in all these U(r). Clearly u is in H(p), proving that (3) 
is a consequence of (3’) and therefore of (1). 

Suppose conversely that (3) is satisfied, and that C(p) is for every prime p 
(dividing m) a coset of G/H(p). There exists by (3) to every » an element 
c(p) in the cross-cut of C(p) and H(p). Put c=|],c(p) (the order of the factors 
does not matter). If p is some prime, then c=c(p) modulo H(p), since c(qg) 
for g~p belongs to H(q) SH(p); and this shows that c is an element in the 
cross-cut of the C(p). Thus (3’) is a consequence of (3); and (3) implies (1), 
since (3’) implies (1). 

If G is the direct product of the H(p), then it is clear that H(p) is the 
direct product of the H(q) for g# >, and that G is, for every p, the direct prod- 
uct of H(p) and H(p). But this last fact shows that (3) is a consequence of (4). 
If conversely (3) is satisfied by G, then G is the direct product of H(p) and 
H(p), since in C-complete groups the cross-cut of H(p) and H(p) is 1. Thus 
H(p) is of order h(p) and the product of the subgroups H(p) is their direct 
product and is of order I],4(?), showing that G is, by Corollary 11.4.2, the 
direct product of the H(p). Hence (3) and (4) are equivalent. 


Coro.iary II.4.4. If G is C-complete, and if h(p) and h(q) are relatively, 
prime for pq, then G is the direct product of the G/H(p). 


REMARK. This criterion is satisfied, for example, whenever h(p) is, for 
every p, a power of p. 

Proof. If h(p) and h(q) are relatively prime for pg, then the I.c.m. of the 
h(p) is equal to their product. Hence we deduce from Corollary I1.4.2 that 
[],4(?) is the order of G. Consequently we may infer from Theorem II.4.3 
that G is the direct product of the G/H(p). 

II.5. The order of the C-character group. Noting that H(p)<Gc, by 
Lemma I.2.3, we may decompose the index A(p) into the product h’(p)h’’(p) 
of two indices where 

h'(p) =index of H(p) in Gc; and 

h''(p) =index of Gc, in G. 


} 
q 
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Furthermore we define an invariant h'''(p) as follows: 

h'''(p) =h''(p), tf Cp ts regular. 

h'''(p) =p™, if p is odd and Cp is irregular. 

h'''(2) =2™®), if C2 is singular. 

h’''(2) =2™)-1, if C2 is neither regular nor singular. 
The motivation for this definition of h’’’(p) will be found in Theorems II.2.1 
and IT.2.2. 


Lemma II.5.1. h'(p) is a power of p, and h''(p) 


Proof. Considering that, by the definition of H(p), the group G/ H(p) is 
Cp-complete, it follows from Theorem I.3.1 that h’(p) is a power of p. Since 
G/Gc, is essentially a group of automorphisms of the cyclic group E(p) of 
order p”), it follows that h’’(p) is a divisor of p™‘»)-"(p—1), which makes 
h''(p) Sh’’'(p) evident. 


THEOREM II.5.2(#*). The order of the C-character group of Gis] ph’ (p)h'’'(p). 


Remakk. If in particular E is of order a power of p, and if G is C-complete, 
then the order of the C-character group of G is h’’’(p) times the order of Ge. 

Proof. Denote by K(p) the group of those Cp-characters of G which map 
Gc, upon 1. Then the order of K(p) is, by Theorems II.2.1 and II.2.2, ex- 
actly h’’’(p), since every Cp-character of G/Gc, is induced by one and only 
orte Cp-character of G which maps Gc, upon 1. It is a consequence of Theo- 
rem I.1.3 that the group of characters of Gc,/H(p) in E(p) is exactly of order 
h'(p). From Theorem I.3.1 we deduce that every character of Gc,/H(p) in 
E(p) is induced by one, and ‘therefore by h’’’(p), Cp-character of G, since 
G/H(p) is Cp-complete. The group of Cp-characters of G is therefore of order 
h'(p)h'’'(p). But the C-character group of G is the direct product of the Cp- 
character groups of G, showing that its order is [] ,h’(p)h’’'(p). 


I1.5.3. Suppose that G is C-complete. 
(a) The order of G does not exceed ihe order of the C-character group of G. 
(b) G and its C-character group are of equal order if, and only if, C is regular. 


Remark. If C is regular, then every h’’(p) is a power of p. Hence h(p) is a 
power of ~ too, from Lemma II.5.1. Consequently we may deduce from 
Corollary I1.4.4 that G is the direct product of the quotient groups G/H(p). 

Proof. It is a consequence of Corollary 11.4.2 and of the C-completeness 
of G that the order of G is a divisor of [],4(p). We infer from Lemma II.5.1 
that h(p) Sh’(p)h’’’(p). Hence (a) is a consequence of Theorem II.5.2. 


(3) It may be deduced from Theorem III.2.2 (a) and Corollary III.3.2 below that [],h’(p) 
equals the order of Gc multiplied by []h’’"(p) where h’’”’(p) is the index of GcH(p) in Gcp. 
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If G and its C-character group are of equal order, then we deduce from 
(a) and Lemma II.5.1 that h’’(p) =h’’’(p). This equality implies the regu- 
larity of C, as follows immediately from the definition of h’’’(p). If conversely 
C is regular, then we infer from Corollary I1.4.4 and Theorem II.4.3 that 
I] ,4(p) is the order of G; and from the definition of h’’’(p) we infer that 
h''(p) =h'''(p). Hence G and its C-character group are by Theorem II.5.2 of 
equal order, if C is regular. 


CHAPTER III. CHARACTERIZATION OF C-COMPLETE GROUPS 


III.1. The case of primary E. The following lemma will prove helpful in 
our discussion. 


Lema III.1.1. If G is a cyclic group of order n, generated by the element g, 
if E is a cyclic group of order m, if C is a homomorphism of G into the group of 
automorphisms of E, and if vis an element in E, then neces- 
sary and sufficient condition for the existence of a C-character f of G such that 
f(g) =. 

Proof. The necessity of the condition is an immediate consequence of 
1=f(1) =f(g") =f(g)*+#+-: +9". If the condition is satisfied, then a single val- 
ued G to E function f is defined by f(g‘) =v'+**:+e** for 0<iSn; and this 
function satisfies v=f(g), 1=f(g")=f(1). If 0<7, 7m, then there exists a 
uniquely determined integer k=0, 1 such that 0<i+j—kn <n; and we find 
that f(gig’) =f(giti-*") = since, in case k=1, we 
may multiply by 1 and we find that 


as was to be shown. 


THEOREM III.1.2. If E is a cyclic group of prime power order m=p*, and 
if C is a homomorphism of the group G into the group of automorphisms of E 
then the following four conditions are necessary and sufficient for C-complete- 
ness of G: 

(i) Ge is abelian; 

(ii) the orders of the elements in Gc are divisors of m; 

(iii) g-'xg =x? for g in G and x in Ge; 

(iv) if p=2, and if the inversion of E is induced by the element z in G, 
then 2? =1. 


Proof. If G is C-complete, then we may infer conditions (i), (ii) from 
Theorem I.3.1, condition (iii) from Corollary 1.3.2; and the necessity of (iv) 
may be seen as follows: if the element z in G maps every element in E upon its 
inverse, and if f is a C-character of G, then f(z?) =f(z)*f(z) =f(z)—'f(z) =1; and 
the C-completeness of G implies z2?=1. 


= 
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We suppose now that the conditions (i) to (iv) are satisfied by G, C. 

* Case 1. G/Ge is cyclic(**). Then there exists an element g which generates 
G modulo Gc. We denote by ’ the order of G/Ge which is at the same time 
the order of g modulo Gc; and we put g* =g”’. If {y} denotes the cyclic group, 
generated by the element y, then {g}c¢={g*}. 

The proof of the following statement will be the most important step. 

(+) There exists a C-character f of {g} such that g* and f(g*) are of equal 
order. 

We distinguish several cases. 

Case 1. n’ is not a power'of p. Then p is odd, g—1 is prime to 9p; and it 
follows from (ii) that x*=x for x in Gc implies x=1. Hence it follows from 
(iii) that g* = g—'g*g = g*? and that therefore g* = 1. (+) is trivially satisfied. 

Case 2. p=2 and g induces the inversion in E. Then it follows from (iv) 
that g?=1. Hence either g*=g*=1 and (+) is obvious; or else g=g* is an 
element in Gc; and the existence of the desired C-character which is a Co-char- 
acter is a consequence of (ii). 

Case 3. n’ is a power of p; and in case p=2 the integer g corresponding to 
the element g under C is congruent to 1 modulo 4. In this case the integer g 
is of the form g=1+p%g’ modulo m where g’ is prime to p, 0<j Su, and where 


imply 1 


Then we infer from the remarks in II.1 that m’=p*~/ and that 1+g+ -- - 
+g”’— is divisible by p*~’, but not by p*-‘+!. We note that m’ is a divisor of 
the order of g so that n=n'n'’ where both n’ and n’”’ are powers of p, 
since ’’ is the order of the element g* in Gc and since we may therefore 
apply (ii) on g*. Since m’ is the order of the automorphism g of E, it fol- 
lows that g*’=1 modulo m; and thus it follows that »=n'n"’ is a divisor of 
+g”) =1+g+ - - - modulo m. Finally we infer from 
condition (iii) that g*=g-'g*g=g** or g=1+p’g’=1 modulo n”’, since n’’ 
is by (ii) a divisor of m; and this shows that ” is a divisor of p’, since n’’ is a 
power of » whereas g’ is prime to p. Hence n=n’n'’=p*-in"’ is a divisor of 
pi=m. 

From the last remark we infer the existence of an element v of order n 
in E. Since it has been shown that » is a divisor of 1+g+ - - - +g*—'it follows 
that v'+e+---+e"*=1; and we infer from Lemma III.1.1 the existence of a 
C-character f of {g} such that f(g)=v. Then f(g*) =f(g*’) is 
an element of order n/n’=n’'’, since 1+g+ --- is divisible by 
n'=p*-i, but not by n’p. Since g* is of order n’’, the C-character f of {g} 
meets all the requirements of (+). 

Case 4. p=2, 1<y, g=—1 modulo 4, g4 —1 modulo m. In this case the 
integer g is of the form g= —1+2%g’ where g’ is odd and 1<j<y. Then we 


(44) A proof of our theorem in Case I may be constructed which makes use of Theorem 
1.3.5, 
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infer from the remarks in II.1 that m’=2*-/ and that 1+g+ - - - +g*’—' is 
divisible by but not by 

It is a consequence of (ii) and (iii) that g*=g~'g*g=g** or 2—2ig’ is di- 
visible by the order n’’ of g*. Since n”’ is a power of 2, it follows that m’’=1 
or 2, that is that g*=1 or g* is of order 2. 

If g*=1, then (+) is obvious. If g* is of order 2, then n=2n’, 
- - - +g*’—') modulo m, since =1 modulo m. 
If v is any element of order m in E, then 1=v!+e+---+0"", since 1+g+ - - - 
+g"—' is divisible by 2 2*-'=m; and it follows from Lemma III.1.1 that 
there exists a C-character f of {g} such that f(g)=v. Then f(g*) =f(g*’) 
=yite+---+e"~ is of order 2, since the exponent is divisible by 2*—', but 
not by 2; and this shows that f meets all the requirements of (+). 

Thus we have completed the proof of (+). 

Suppose now that fo is a Co-character of Gc in E. Then it follows from (+) 
that there exists a C-character f; of {g} which coincides with fy on the cross- 
cut {g*} of Gc and {g}. Hence we may infer from Lemma I.3.4 the existence 
of a C-character f of G which induces fo in Ge and f; in {gt}, since G=Ge{g}, 
and since the conditions (i) to (iii) are satisfied by G and C. This shows that 
every Co-character of Gc in E is induced by a C-character of G. But this 
implies C-completeness of G, as follows from Theorem I.3.1, since (i), (ii) are 
satisfied by G and C. 

Case Il. G/Ge is not cyclic. In this case p= 2; and G/Gc is the direct prod- 
uct of a cyclic group V by a cyclic group Z of order 2 such that the element of 
order 2 in Z maps every element in E upon its inverse (note that 2<y). We 
denote by W the uniquely determined group between Ge and G which satisfies 
W/Gc=V; and we infer from what has been shown in Case I that W is 
C-complete. 

If z is any element, representing the coset of order 2 in Z, then it follows 
from gs (iv) that z is of order 2. We note finally the obvious fact 
G=W}3}. 

The proof of the following statement will be the most important step. 

(++) Every C-character of W is induced by a C-character of G. 

Mapping every element g in W upon the element zgz effects an automor- 
phism of W; and g and zgz clearly induce the same automorphism of EZ. Hence 
it follows from Lemma fI.2.2 that f:(x) =f(zxz) is a C-character of W whenever 
fis a C-character of W. 

Suppose now that f is any C-character of W. Then it follows from the iast 
remark that f2(x) =f(x)fi(x) =f(x)f(zxz) is a C-character of W. If y is any 
element in Ge, then it follows from (iii) that zyz=y~!; and since C-characters 
of W induce Co-characters of Gc, we find that f2(y)=1 for y in Go= We. 
Since none of the automorphisms of E which are induced by elements in W 
maps every element in Z upon its inverse, we may apply Theorem II.2.1 upon 
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fe. Hence fe belongs to the principal genus of W, that is there exists an ele- 
ment v in E such that v'~*=f2(x) =f(x)f(2xz) for x in W. 

Since W is a normal subgroup of G, and since z is an element of order 2, 
generating G modulo W, it is possible to represent every element in G in one 
and only one way in the form g=w(g)z* for w(g) in W and 0 Si(g) $1. Con- 
sequently a single-valued G to E function is defined by 

= 
where v is the element in E introduced before. 

The element g is in W if, and only if, i(g) =0 and g=w(g), proving that f 
and f’ coincide on W. If g and y are elements in G, then gy = w(g)2* w(y)2*™ 
Thus w(gy) =w(g)z*w(y)z* and i(gy) =4(g) 
+i(y) modulo 2, since W is a normal subgroup of G. From the choice of the 
element v it follows that f(z‘xz*) =v'-="f(x)-* for x in W and i=0, 1. This 
implies that 


f[w(gy)] = f[w(g) w(y)2* 
= f[w(g) f[w(y) 


One verifies furthermore by substituting the possible values for i(g) and i(y) 
that 


[1 — w(y)* ](— 1) + i(gy) = + w(y)(— 1)*™. 


Consequently we have finally 
f'(gy) f[w(gy) view 


= f[w(g) [w( y) ] (—1)*) 9 i(y)+ w(y) (—1)*) 
= { f[w(g)] Jem F[w(y) yim 
= f’(g)*f'(y). 

Thus f’ has been shown to be a C-character of G which induces f in W; 
and this completes the proof of (++). 

Since W is C-complete and contains Gc= We, it follows from Theorem 
1.3.1 that every Co-character of Gc in E is induced by a C-character of W; 
and hence we infer from (++) that every Co-character of Gc in E is induced 
by a C-character of G. The C-completeness of G is now an immediate infer- 
ence of Theorem I.3.1 and conditions (i), (ii); and this completes the proof 


of the theorerm. 
We note that we have proved the following statement: 


III.1.3. If E is a cyclic group of order m=2*, 1<y, if Cis a 
homomorphism of the group G into the group of automorphisms of E such that 
some elements in G are mapped by C upon the inversion of E, if G is C-complete 
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(satisfies (i) to (iv) of Theorem I11.1.2), and if the subgroup W of G contains Ge, 
does not contain elements z inducing the inversion in E, though G is generated 
by adjoining z to W, then every C-character of W is induced by a C-character of G; 
and if f is a C-character of W, v an element in E, then there exists a C-character 
of G which induces f in W and maps z upon v if, and only if, v'-* =f(x)f(zxz) 
forxin W. 


III.2. Determination of the subgroups H(p). Throughout this section we 
shall make use of the notations (II.3.*). For the enunciation of the next theo- 
rem it will be convenient to introduce the subgroup Hi(p) =(Gep)?””(Gep)’ 
where X’ is the commutator subgroup of the group X. Clearly Hi(p) is the 
smallest subgroup of Gc, such that Gc,/Hi(p) is an abelian group the orders 
of whose elements are divisors of p™?) and H,(p) is a normal subgroup of G. 


THEOREM III.2.1. (a) Ai(p) SH(p) SGep. 

(b) If Cp is not singular, then H(p)/Hi(p) is generated by the elements 
g~ixgx~* for g in G/Hi(p) and x in Gc,/Hi(p). 

(c) If C2 ts singular, then H(2)/Hj,(2) is generated by the elements g-'xgx-9 
for g in G/H,(2), x in Ge2/Hi(2) and by the squares of those elements in G/H,(2) 
which induce the inversion in E(2). 


Proof. It is a consequence of the definition of H(p) that G/H(p) is Cp-com- 
plete. We infer H(p) SGc, from Lemma I.2.3; and we deduce from Theorem 


III.1.2 the validity of the following conditions: 

(1) Ge,/H(p) is an abelian group the orders of whose elements are di- 
visors of p™??, 

(2) g-'xg=<x* for gin G/H(p) and x in Gce,/H(p). 

(3) 2*=1, if C2 is singular, and if the element z in G/H(2) induces the in- 
version in E(2). 

Condition (1) implies H,(p) S H(p). 

Now we define the subgroup H2(p) between Hi(p) and H(p) as follows: 
If Cp is not singular, then H2(p)/Hi(p) is generated by the elements g-'xgx~? 
for g in G/Hi(p) and x in Gc,/Ai(p). If, however, C2 is singular, then 
H,(2)/H,(2) is generated by the elements g-'xgx-* for g in G/H,(2), x in 
Ge2/H,(2) and by the squares of those elements in G/H;(2) which induce the 
inversion in E(2). From conditions (2) and (3) it is immediately deduced 
that H2(p) SH(p). 

It is an immediate consequence of the definition of the subgroup H2(p) 
and of Theorem III.1.2 that the group G/H2(p) is Cp-complete. Thus it fol- 
lows from Lemma I.3.3 that H(p) SH2(p), proving the desired equality of 
H(p) and H;(p). 

We denote by p”"?) the order of the subgroup of all those elements in E(p) 
which are left invariant by every automorphism of E(p) that is induced by an 
element in G. Thus the integer corresponding to the element g in G under Cp 
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is of the form g=1+/g) modulo p™”) for go a suitable integer. From the 
facts enumerated in II.1 one readily deduces the following two statements: 
J(p) =0 if, and only if, m(p) =0 or G/Gc, is not of order a power of p. 
1SJ(2) whenever 1 Sm(2). 
For the investigations concerning the principal genus it is important to 
determine the subgroup H*(p) of all those elements in G which are mapped 
upon 1 by all the Cp-characters of G mapping Gc upon 1. 


LemMa~ III.2.2. (a) H*(p) = H(p)Ge. 

(b) If Cp is not singular, then H*(p)/Gco=(Gep/Gc)”™. 

(c) If C2 is singular, then H*(2)/Ge is generated by the elements in 
(Ge2/Gc)? and by the squares of those elements in G/Gc which induce the inver- 
sion in E(2). 


We note that J(2)=1, if C2 is singular. 

Proof. It is an immediate consequence of the definition of H*(p) that 
H(p)GcSH*(p). Since Gc is mapped upon 1 by the principal genus, and 
since there exists to every element not in Gc, a principal Cp-character which 
does not map it upon 1, it follows that H*(p) Gc». If w is an element in Gep, 
though not in H(p)Gc, then we infer from the Cp-completeness of G/H(p) 
and from Theorems I.3.1 and I.1.4 the existence of a Cp-character of G which 
maps H(p)G¢ upon 1, though it does not map w upon 1. Thus w cannot belong 
to H*(p) and hence H*(p) = H(p)Ge. 

G/Ge is essentially a group of automorphisms of a cyclic group. Hence 
G/Gc is an abelian group. It is an immediate consequence of the definition 
of H*(p) that G/H*(p) is Cp-complete; and thus it follows from Theorem 
III.1.2 that 

(1) the orders of the elements in the abelian group Gc,/H*(p) are di- 
visors of p™?), 

(2) x=g-'xg=x* for x in Gc,/H*(p) and g in G/H*(p), 

(3) s?=1, if 1<m(2) and if the element z in G/H*(2) induces the inver- 
sion in E(2). 

From (2) we infer that the orders of the elements in Gc,/H*(p) are di- 
visors of p’‘”), Thus we define the subgroup H**(p) between Ge and Ge, as 
follows: If Cp is not singular, then H**(p)/Gc=(Gep/Gc)”"™. If C2 is singu- 
- lar, then H**(2)/Ge is generated by the squares of elements in G/Ge which 
induce in E(2) the inversion or the identity. From what has been shown thus 
far we deduce H**(p) SH*(p). (The condition (1) may now be discarded, 
since J(p) Sx2(p).) 

If x is an element in Gc,/H**(p) and g an element in G/H**(p), then it 
follows from the construction of H**(p) that G/H**(p) is commutative, that 
the orders of the elements in Gc,/H**(p) are divisors of 7, and that there- 
fore g-'xg =x =x* for x in Gc,/H**(p) and g in G/H**(p). Hence by Theorem 
III.1.2 we have the Cp-completeness of G/H**(p), proving H*(p) = H**(p). 
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THEOREM III.2.3. The principal genus is exactly the set of the C-characters 
mapping Gc upon 1 if, and only if, 

(1) Cas not singular, and 

(2) [Gep:Gc] is prime to p for every prime p such that m(p) #0 and G/Gcy» 
is of order a power of p. 


REMARK. J(p) #0 if, and only if, m(p) #0 and G/Ge, is of order a power 
of ~, as follows from the remarks in IT.1. 

Proof. Suppose first that the principal genus contains every C-character 
mapping Ge upon 1. Then it follows from Theorem II.3.2 that (1) is satisfied 
by G and that Gc, = H(p)Gc. Hence we may deduce from Lemma III.2.2(a) 
that Gc,=H*(p), and from Lemma III.2.2(b) that Ge,/Ge=H*(p)/Ge 
Consequently J(p) #0 implies that the order of Ge»/Ge is 
prime to p; and this shows the necessity of condition (2), since J(p) <0 if, 
and only if, m(p) #0 and G/Gce, is of order a power of p. 

If conversely conditions (1) and (2) are satisfied by G, it follows from 
Lemma III.2.2(a) and (b) and the above Remark that (GceH(p))/Ge 
= H*(p)/Ge=(Gep/Ge)” ” =Gep/Ge. Thus GcH(p) =Gcp; and we infer from 
Theorem I1.3.2 that the principal genus contains all the C-characters, mapping 
Gc upon 1. 

III.3. The case of composite E. We remind the reader of the fact that 
C-completeness of the group G is equivalent to the property: the cross-cut 
of all the subgroups H(p) is 1. This latter property implies that the cross- 
cut of H(p) and H(p) is 1, for every p, since H(p) is the cross-cut of all the 
H(q) for gp. The importance of the last criterion rests on the fact that H(p) 
consists of the elements mapped into E(p) by every C-character of G. 


THEOREM III.3.1. Each of the following properties implies the others. 

(a) Gis C-complete. 

(b) H(p) does not contain elements of order p in Gc("*). 

(c) Ge is abelian and the cross-cut of H(p) and Ge consists of the elements of 
order prime to p in Go("*). 


Proof. If y is an element of order p in Ge, then y belongs to H(p), since 
C-characters of G induce Co-characters of Gc in E. If G is C-complete, then 
the cross-cut of H(p) and H(p) is 1 so that elements of order p in Ge cannot 
be in H(p). If G is not C-complete, then the cross-cut H of the subgroups 
H(p) is different from 1 and is part of Gc so that there exists an element w 
of prime order 7 in H. This element w in Gc belongs to H(r), since it belongs 
to H; and thus we have shown the equivalence of (a) and (b). 

If G is C-complete, then Ge is abelian by Theorem I.3.1 so that Ge is the 
direct product of groups P and Q where P is of order a power of p and Q of 


('*) Conditions (b) and (c) are supposed to be valid for every prime ». 
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order prime to p. It is a consequence of (b) that the cross-cut of H(p) and P 
is 1; and it is a consequence of Theorem III.2.1 that Q is part of H(p), that is 
Q is the cross-cut of Gc and H(p). That (b) is a consequence of (c) is obvious; 
and this completes the proof. 


Coro.tuary III.3.2. G is C-complete if, and only if, H*(p) is, for every 
prime p, the direct product of H(p) and the p-component of the abelian sub- 
group Ge. 


Proof. If G is C-complete, then Ge is abelian by Theorem III.3.1(c). If Ge 
is abelian, then it is the direct product of two subgroups U(p) and V() 
where U(p) consists of the elements of order a power of p in Gc and where 
V(p) consists of the elements of order prime to p in Gc. Since C-characters 
of G induce Co-characters of Ge, it follows that V(p) S H(p). Hence we deduce 
from Lemma III.2.2(a) that H*(p) = H(p) U(p). If G is C-complete, then it 
follows from Theorem III.3.1(b) that H*(p) is the direct product of H(p) 
and U(p). If conversely H*(p) is the direct product of H(p) and U(p), then 
the C-completeness of G is a consequence of Theorem III.3.1(b). 

The criteria for C-completeness contained in Theorem III.3.1 and Corol- 
lary III.3.2, though sufficient for many applications, are unsatisfactory, since 
the subgroups H(p), H*(p) which are defined in terms of C-characters are in- 
volved, whereas a satisfactory criterion should involve nothing but structural 
properties of G and C. We note that Theorem III.3.1(2) is really a family of 
conditions, namely one for every prime p. To solve our problem it is both 
sufficient and advantageous to find for every preassigned prime #, a criterion 
assuring the absence("*) of elements of order p in the cross-cut of H(p) and Ge. 
The following concept will be used in the enunciation of such a criterion. 

(III.3.*) Q=Q(p) is the set of all the elements in Gc, whose order is 
prime to p. 


THEOREM III.3.3. If Gc ts abelian(?"), and if p is a prime, then the following 
conditions are necessary and sufficient for the absence("*) of elements of order p 
tn the cross-cut of H (b) and Gc: 

(1) Q(p) és a normal subgroup of G and Gc,/Q(p) is an abelian group of 
order a power of p. 

(2) If y is an element in G and x an element in Gc, such that xy~'x~y is 
of order a power of p, then (xy~ xy)?" =1; and if furthermore x?” 
ts an element of order a power of p in Gc, then(#*) yxy =x". 


(*) This is property (b) of Theorem III.3.1, stated for one prime number only. 

(!7) Theorem III.3.1 (c) shows that the commutativity of Gc is necessary for the com- 
pleteness of the group G. 

(#8) Here the exponent y stands for a suitable integer which is modulo p"‘») congruent to 
the integer corresponding to the element y under Cp. It can be shown, however, that the order 
of the element x is a divisor of p*‘». 
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(3) Suppose that p=2, 1<m/(2) and that the element y in G induces the in- 
version in E(2)(*). 

(3’) If y? is an element of order a power of 2 in Gc, then y?=1. 

(3’’) If g is an element in G such that the integer corresponding to g under 
C2 is congruent to —1 modulo 4, if x is an element in Geez such that x, g~'xgx-" 
and are of order a power of 2, then(?°) g—xgx—! = 


Proof. A. Assume that there are no elements of order in the cross-cut 
of H(p) and Ge. 

We infer from Lemma I.2.3 that H(p) SGc,; and from the definition of 
H(p) that G/H(p) is Cp-complete. Thus it follows from Theorem III.1.2 that: 

(i) Gc,/H(p) is an abelian group the orders of whose elements are di- 
visors of p™?); 

(ii) for v in G/H(p) and x in Gc,/H(p); 

(iii) 22=1 for z an element in G/H(2) inducing the inversion in E(2) pro- 
vided p=2 and 1<m(2). 

As an abelian group Gc is the direct product of subgroups P and K where 
P consists of all the elements of order a power of p in Gc and where K con- 
sists of all the elements of order prime to p in Ge. Since P and K are charac- 
teristic subgroups of Gc, they are normal subgroups of G. We infer from (i) 
that K S H(p); and from our hypothesis, that the cross-cut of H(p) and P is 1. 
Hence K is the cross-cut of H(p) and Ge. 

From (i) and the commutativity of G/Gg¢ it follows that the commutator 
subgroup Gc, of Gey is part of the cross-cut K of H(p) and Ge. Hence 
Gc,/K is abelian and is therefore the direct product of groups S and T where 
S consists of all the elements of order prime to p whereas T consists of all the 
elements of order a power of p. Denote by S* the subgroup of Gc, which 
contains K and satisfies S=S*/K. The orders of the elements in S* are 
prime to p, since this holds true for the orders of the elements both in K and 
in S*/K, that is S* <Q(p). If r is an element in Q(p), then its order is prime to 
p and it is contained in a coset of Gce,/K whose order is prime to ), that is r is 
in a coset belonging to S; and thus we have shown that Q(p) =.S*. Thus Q(p) 
is a subgroup of Gcp; it is therefore a characteristic subgroup of Gc, and a 
normal subgroup of G. Since KSQ(p) and Ge,/K is abelian, Gcep/Q(p) is 
abelian of order a power of p, proving the necessity of condition (1). 

If y is any element in G, then we also denote by y some integer that corre- 
sponds modulo p™) to the element y under Cp. From the definition of J(p) 
we infer the existence of an integer yo such that y=1+p" yo. 

If y is an element in G and x an element in Gey», then we infer from (ii) that 


(*%) This condition is vacuous, if none of the elements in G induces the inversion in E(2). 

(2°) The exponent (1—g)2~! stands for any suitable integer meeting the obvious require- 
ments; it is a consequence of the first part of condition (2) that the value of (y~xyx—)¢-0 
is independent of the particular choice of (1 —g)271. 


{ 
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is an element in H(p). If both y—'xyx-! and x?” are elements of order a 
power of p in Gc, then y~'xyx~" is an element of order a power of p in the 
cross-cut of H(p) and Ge so that y~'xyx-¥=1, proving the necessity of the 
second part of condition (2). If we do assume only that y—'xyx-' is an element 
of order a power of p (in Gc), then we infer from (i) and the above equation 
that 


so that (y—xyx-!)»"”~ is an element of order a power of p in the cross- 
cut of H(p) and Gc. But such an element is equal to 1, completing the proof 
of the necessity of (2). 

Suppose now that p=2 and 1<m(2), that the element y in G induces the 
inversion in E(2) and‘ that y? is an element of order a power of 2 in Gc. Then 
it follows from (iii) that y? is an element of order a power of 2 in the cross-cut 
of Gc and H(2), that is y?=1, showing the necessity of (3’). 

If p=2, 1<m/(2), if y induces the inversion in E(2) and if the integer cor- 
responding to the element g in G under C2 is congruent to —1 modulo 4, 
then (1—g)2—' is an odd integer. If g-'xgx-! and y~—'xyx—' for x an element 
in Gc: are elements of order a power of 2 (in Gc), then we infer from the com- 
mutativity of Gce2/H(2) (see (i)!) that 


= = 1 modulo H(2) 


by (ii); and thus g—xgx-"(y—xyx-!) @-2 = 1 as an element of order a power 
of 2 in the cross-cut of H(2) and Ge, proving the necessity of (3’’). 

B. Assume that the conditions (1), (2), (3) are satisfied by G, C, p. We 
distinguish two cases. 

B.1. Cp is not singular. We select a p-Sylow subgroup M of Gc». The 
cross-cut of Q(p) and M is 1; and M contains the subgroup P of all the ele- 
ments of order a power of p in Ge, since P is a normal subgroup of G. Since 
any two p-Sylow components of Gc, are conjugate in Gc», and since every 
element in G transforms M into another p-Sylow subgroup of Gp, it follows 
that every coset of G/Gc» contains elements from the normalizer of M in G. 
It follows from the special hypothesis B.1 that G/Gc, is a cyclic group; and 
thus we have shown the existence of an element g in G which generates G 
modulo Gc, and which transforms M into itself. 

It is a consequence of condition (1) that Q(p) is a normal subgroup of G, 
and that every coset of Gc»/Q(p) contains one and only one element in M. 
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Thus M and Gc,/Q(p) are isomorphic groups and we infer from (1) the com- 


mutativity of M. 

If we denote by N the subgroup of G which is generated by adjoining the 
element g to M, then M is a normal subgroup of N, every coset of N/M is 
represented by some power of g, and all the elements in the same coset Mg‘ 


of N/M induce in M the same automorphism. 


G=NGec, 


H*(p) = H(p)Gc= H(p) XP 


K =Gcl\H(p) 


1=MAH(?) 


We put From Q(p)=Q(p)?"® and from the com- 
mutativity of M (together with the fact that every coset of Gc,/Q(p) contains 
one and only one element in M) we infer that 


ot = 


Clearly it follows from (1) that Ge,/Q*(p) is an abelian group the orders of 
whose elements are divisors of p™”. It is readily seen now that Q*(p) is 


N Gcep= MQ(P) 
\M =P Q 
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exactly the group H;(p) introduced in Theorem III.2.1; and we infer from 
this theorem that H(p) is the uniquely determined subgroup of Gc, which 
contains Q*+ such that H(p)/Q* is exactly the subgroup of Gc,/Q* which is 
generated by all the elements v—wvw~ for v in G/Q* and w in Gce,/Q*. 
Since Gc,/Q* is by (1) an abelian group the orders of whose elements are 
divisors of p™”), it follows that every coset V of G/Gc, induces a well-deter- 
mined automorphism in Gc,/Q*+ which we denote by V also. Thus o—wow~* 
=w’—* for v in G/Qt, w in Gc,/Qt and V=Ge,v. From our choice of the ele- 
ment g we infer the existence of an integer 4 such that V = Ge pg‘; and for the 
integers corresponding under Cp to these elements it follows likewise that 
v=g‘ modulo p™). If we put Z=Gepg, then we find for v in G/Q* and w in 


Ge,/Qt: 


From this equation one readily deduces the following fact: 
H(p)/Q* = 


Now let g stand for any integer which modulo p””) corresponds under Cp 
to the element g in G. If x is an element in H(p), then we infer from (’’) the 
existence of an element Y in Gc,/Q* such that Qtx =Q+ Y2-*. But Y contains 
an element y in M. Hence it follows from (’) that x is in 


Thus we have proved the following statement. 
(’’’) Every element in H(p) has the form: 


pm) —1 
2 


X= XX, Xogx for 29 in Q(p), x: and x, in M. 


We note that by the choice of g the element g~'x2g is in M too so that x7'x be- 
longs to M. 

Suppose now that x is an element of order a power of p in the cross-cut . 
of H(p) and Ge. Then it follows from (’’’) and the remark following (’’’) 
that both x and xs'x belong to M, since PSM. Hence x» is in the cross-cut 
of Q(p) and M so that x»=1. Since g~'xegxz"! is in the cross-cut P of M 
and Ge, it follows that x(g—xegxz!)-'=x7""x}" is in P too. (Use the 
commutativity of M.) From the definition of the integer g and the invariant 
J(p) we infer the existence of an integer g* such that 1—g =g*p"‘”); and since 
the element g generates G modulo Ge», that is the automorphism correspond- 
ing to g under Cp generates the group of automorphisms of E(p) induced by 
elements in G, it follows that g* is prime to p. Hence 


p™)—J(p) g* pir) 1-9 
(x1 X2 ) =% 
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is in Ge and even in P; and it follows from condition (2) (second part) that 


ge p™P)—J(p) g 
(% )g = (m1 %2) » 


—1 


and consequently we find that 


(g xg 
p™ (p)—J (Pp) —1 pr) —1 


= (g g) = (xg g 
= 1 


as follows from the first part of condition (3). But g* is prime to p; and hence 
it follows that x =1; and we have shown that the only element of order a 
power of p in the cross-cut of H(p) and Ge is 1, as we desired to do. 

B.2. p=2 and C2 is singular. Then J(2) =1, as has been pointed out be- 
fore. 

We select a 2-Sylow subgroup N of G and denote by M the cross-cut 
of N and Ge2. Since Ge: is a normal subgroup of G, it follows that M is a 
2-Sylow subgroup of Gee. Since both Ge2/Q(2) (by (1)) and G/Ge: are of order 
a power of 2 (see II.1.B), it follows that every coset of G/Q(2) contains one 
and only one element in N; and we infer from (1) the commutativity of M, 
as M and Ge2/Q(2) are isomorphic. 

We denote by Y the set of all the elements in G which induce the inver- 
sion in E(2). Clearly Y is a coset of G/Gce2 and the cross-cut Y* of Y and N 
is a coset of N/M. 

We denote by Q+=@Q*(2) the subgroup Ge”; and as in B.1 it follows 
that Q*(2) =Q(2) and that Ge2/Q*(2) is an abelian group the orders of 
whose elements are divisors of 2". Then one deduces from Theorem III.2.1 
that 

(+) H(2) is the uniquely determined subgroup of Gcz: which contains 
Q+(2) such that H(2)/Q*(2) is generated by the elements y? for y in Y and 
by the elements v~'wvw~* for v in G/Q*(2) and for w in Ge2/Q*(2). 

It will be convenient to start with the investigation of a (possibly smaller) 
subgroup, namely the subgroup Q° between Q*+(2) and H(2) which is uniquely 
determined by the property that Q°/Q* is generated by the elements y? for y 
in the coset Y of G/Gce [consisting of all the elements in G/Q+ which induce 
the inversion in E(2)]. Noting that every coset of G/Q(2) contains one and 
only one element in N and that the inverses of elements in Y belong to Y, 
it is readily verified that every element x in Q° has the form: 


x =% (g xegxe ) 
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m (2) 
= xox, for 0 S k, x inQ, x; in M, y; in Y*. 


If 1<k, then it is possible to simplify this normal form for elements in Q° 
as follows: Since every y: is an element in N which induces the inversion in 
E(2), all the elements y; are in the same coset Y* of N/M; and hence there 
exist elements 2; in M such that y;=2,j1. Since all the elements in Y* induce 
in the abelian group M the same automorphism which we denote by Y%*, it 
follows that y? =2!*""y?. Since z; and y} are elements in the commutative 
group M, we deduce that y? - - - yz=(z, - - - %)!+¥*y?*. Considering that an 
odd power of an element in Y* is an element in Y*, we see finally that every 
element x in Q° has the form: 


oh 
(*) X= Xs 


for xo in Q, x; and x2 in M, where x3;=1 if h=0 and x; is in Y* if 0<h. 

Now we distinguish two cases: 

B.2.1. G/Gee2 is of order 2. Then the elements in G induce in E(2) the in- 
version and the identity; and it follows from (+) and (*) that every element 
x in H(2) has the form given in (*). Suppose now that such an element x 
belongs to P (is an element of order a power of 2 in Gc). Since PSM, since 
M is an abelian 2-group, and since x¢'!x and x both belong to M, it follows 
immediately that x»=1. We note furthermore that x¥*—! belongs to the cross- 
cut P of Ge and M, since it belongs to the commutator subgroup of G which 
is part of Ge. 

If h=0, then and consequently x2 
=x x} "is an element in P. Hence it follows from the second part of condition 
(2) that = or and thus we find 
that 


2m(2)_ 4.1) gm(2)-1 (1—¥*) 2m@)—J(2) 
x= = = 


by the first part of condition (2). 
Next if then 


14¥* 2 2m(2) 2 2 


gm(2)—1 2 
= (x1 


and this shows that x7" ‘xx; is an element in Y* whose square is in P. Thus 
it follows from condition (3’) that this square is 1 so that x =x@-¥)2"°7® 
=1 by condition (2) (first part). 

If finally 1<h, then 2*-!—1 is odd; and we find that 


gm(2)-1 2 (2-1_1)2 
x= (x1 X2%X3) X3 
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Since both x¢@"-” and x7"” ‘xaxs are elements in Y*, there exists an ele- 
ment z in M such that 2x7"° 'xsx;=2x}-""; and consequently we have 
x=" l-M+l" The first of the factors is 1 as a consequence of the 
first part of condition (2); and hence x = 1 is a consequence of the second part 
of (2). This completes the proof of the fact that in the case under considera- 
tion the cross-cut of P and H(2) is 1. 

B.2.2. G/Ge2 ts not of order 2. Then we infer from II.1.B and from the 
existence of elements in G which induce the inversion in E(2) that there 
exists a basis Z, Y of G/Gc2 where Y induces the inversion in E(2) and where 
the integer corresponding to Z under C2 is congruent to —1 modulo 4. We 
note that the cross-cuts Z* and Y* of Z and Y respectively with N are cosets 
of N/M which induce the automorphisms Z* and Y* respectively in the com- 
mutative group M. 

Now one derives from (+) and (*) with the same arguments as were 
used in B.1 that every element x in H(2) has the form: 


( ) x = 


where xp is in Q(2), where x1, x2 and x3 are elements in M and x, is an element 
in Y* if 0<h, and x,=1 if 0=h, and where g is an integer which is modulo 2" 
congruent to the integers corresponding under C2 to the elements in Z*. 

If x is an element if the cross-cut of H(2) and P, then it follows as usual 
that x»=1. It follows from condition (3’’) and the first part of condition (2) 
that = 1 and that = 1; and thus we find that 


14¥* (1+¥*)(1—g)2-1 


If xi’ =1, then x is the (1+ Y*)th power of some element in M, and x=1 
is a consequence of the sevond part of condition (2). 

If h=1, then x =v!+¥*x3 = (vx,)? for v an element in M and ox, an element 
in Y*; and it follows from condition (3’) that x =1. 

If 1<h, then x for some element w in M (the 
existence of w is seen as in case B.2.1) and we infer again from the second part 
of condition (2) that x =1. Thus it follows that the cross-cut of P and H(2) 
is 1; and this completes the proof of the theorem. 

Remark. From condition (2) one derives readily that x°"” =1 for every 
element x in Gc, such that x?” is an element of order a power of p in Ge. 


CHAPTER IV. UNIQUENESS OF THE C-CHARACTER GROUP 


The group B of C-characters of the group G shall be termed complete, if 1 
is the only element in G mapped upon 1 by every C-character in B; and the com- 
plete group B of C-characters of G may be called strictly complete, if B con- 
tains the principal genus. The C-character group of G (that is, the group of 
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all the C-characters of G) is obviously complete if, and only if, G is C-complete. 
Thus there arises the problem(*") of determining conditions assuring that the 
C-character group of G is the only complete (the only strictly complete) group 
of C-characters of G; and we devote this chapter to a solution of this problem. 

IV.1. Criteria depending on the nature of the homomorphism C only. If G 
is C-complete, then G is by Theorem I.3.1 an extension of the abelian group 
Gc by the abelian group G/Gc; and the homomorphism C effects an isomor- 
phism of G/G¢ into the group of automorphisms of the cyclic group EZ. The 
criteria in this section will depend (like the investigations of Chapter II) only 
on the nature of the isomorphism C of the quotient group G/Gc. 


THEOREM IV.1.1. If the group G is C-complete, if the order of Gc,/Gc ts 
prime to p whenever 0<m(p) and G/Gc, ts of order a power of p, and if C2 is 
not singular, then the C-character group of.G is the only strictly complete group 
of C-characters of G. 


Proof. If B is a complete group of C-characters of G, then it follows from 
the C-completeness of G and from Theorems I.3.1 and I.1.4 that every 
Cy-character of Gc in E is induced by some C-character in B. Thus there exists 
to every C-character f of G at least one C-character 5 in B such that f and } 
coincide on Ge. Since fb is a C-character of G which maps Ge upon |, it 
follows from our hypotheses and Theorem III.2.3 that fd—'! belongs to the 
principal genus. If B is strictly complete, then fd is in B, showing that f 


itself belongs to B, that is, every strictly complete group B of C-characters 
of G is the (full) C-character group of G. 


THEOREM IV.1.2. If G is C-complete, if C is regular, then the C-character 
group of G ts the only complete group of C-characters of G. 


Proof. It is an immediate consequence of Corollary II.5.3 and our hy- 
pothesis that G and its (full) C-character group are of equal order N. Suppose 
now that B is a complete group of C-characters of G. Then its order N* cer- 
tainly does not exceed N [is a divisor of N]. 

If g is an element in G, then we define a single-valued B to E function(??) 
F,(f) by the. rule F,(f) =f(g) for every f in B. It is readily verified that F, is 
a Co-character of B in EZ; and we note that B is an abelian group the orders 
of whose elements are divisors of the order m of E. The elements x and y 
in G satisfy F,=F, if, and only if, f(x) =f(y) for every f in B; and this is 
equivalent to 


(#) A solution of this problem is important by its applications to the theory of radical 
extensions of fields, as will be shown elsewhere. 
(#) Cp. V.1 below for a more detailed study of this operator F,. 
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for every f in B. Since B is a complete group of C-characters of G, it follows 
that F, =F, if, and only if, xy-'=1 or x=y. Thus we have shown that 
the set Fg of all the Co-characters F, of B consists of exactly N characters. 
It is a consequence of Theorem I.1.3 that the order of the group of all the 
Co-characters of B in E is exactly N* showing that NS N*. But we have 
pointed out before that N* does not exceed N showing that N=N’*, that is, 
B and the (full) C-character group of G are of equal order and are therefore 
equal, as was to be shown. 

We note that the methods employed in the proofs of the two preceding 
theorems are completely exhausted by these theorems. 

The following lemma will prove useful in the future. 


LemMA IV.1.3. If G is C-complete, if G* is a subgroup of G such that 
G=GG*, if B is a group of C-characters of G such that every C-character of G* 
is induced by some C-character in B, and if to every element w in Gc whose order 
modulo(**) G* is a prime there exists a C-character in B which maps G* but not w 
upon 1, then B is the full C-character group of G. 


Proof. If f is a C-character of G, then there exists a C-character } in B 
such that f and bd coincide on G*. Thus f;=fb—'! is a C-character of G which 
maps G* upon 1. If B* is the group of all the C-characters in B which map G* 
upon 1, then it follows from our hypotheses that G¢* is the set of all the ele- 
ments in Ge which are“mapped upon 1 by all the characters in B*. Hence it 
follows from Lemma I.3.4 and Theorem I.1.4 that every Co-character of Ge 
in E which maps G¢* upon 1 is induced by one and only one C-character in B*; 
and thus it follows that the character f;, constructed above, belongs to B*. 
Hence f=fib belongs to B; and this completes the proof. 

IV.2. The case of primary E and cyclic G/Gc. Throughout this section 
we impose the following hypotheses(*) : 

(IV.2.*) E is a cyclic group of prime power order m = p*, C is a homomor- 
phism of the group G into the group of automorphisms of £, and G is C-com- 
plete. 
These hypotheses imply the cyclicity of G/Ge, if m is odd, but not in the 
case of even m. Thus we add the following requirement: 

(IV.2.**) G/Ge is cyclic. 

THEOREM IV.2.1. The C-character group of G is the only strictly complete 
group of C-characters of G if, and only if, C is not singular. 


Proof. If the condition of the theorem is satisfied, then the hypotheses of 
Theorem IV.1.1 are satisfied too, showing the sufficiency of our condition. 


(3) More precisely, modulo the cross-cut of G* and Ge. 
(*) Without restating them in the hypotheses of the various theorems of this section. 
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If the condition is not satisfied, then p=2, 1 <y and there exists in G an ele- 
ment z inducing the inversion in E. Because of the cyclicity of G/G¢ it follows 
that z generates G modulo Gc; and z?=1 is a consequence of the C-complete- 
ness of G, as follows from Theorem III.1.2. We denote by B the group of all 
those C-characters of G which map z upon an element in E*. From v!-*=v? 
for v in E we infer that B contains the principal genus. From Lemma I.3.4 and 
z*=1 we deduce that every Co-character of Gc in E is induced by one and only 
one C-character of G which maps z upon 1; and this implies that every 
Co-character of Gc in E is induced by some C-character in the group B. Hence 
it follows from the C-completeness of G and Theorems I.3.1 and 1.1.4 that 
there exists to every element y#1 in Gc a C-character in B which does not 
map y upon 1; and if y is an element in G though not in Ge, then y is mapped 
upon elements different from 1 by the principal genus which is part of B. 
Thus B is a strictly complete group of C-characters of G. But we infer from 
Lemma I.3.4 and z?=1 the existence of a C-character of G which maps Ge 
upon 1 and the element z upon an element not in E?, that is, B is not the full 
C-character group of G as was to be shown. 


THEOREM IV.2.2. The C-character group of G is the only complete group of 
C-characters of G if, and only if, one of the following (mutually exclusive) condi- 
tions is satisfied: 

(1) If p is odd and G#Ge, then either m =p or the order of G/Gc is divisible 
by p. 

(2) If p=2, G/Ge of order 2, and if C is irregular, then G2 <G’. 


Proof. If G=Gc, then the C-characters of G are Co-characters of G in E; 
and it follows from Theorem I.1.4 that the character group of G is the only 
complete group of characters. Consequently we assume in the following that 
G and G¢ are different. This implies in particular that m is neither 1 nor 2. 

If C is regular, then we infer from Theorem IV.1.2 that the C-character 
group of G is the only complete group of C-characters of G. Consequently we 
assume in the following that C is irregular. ;, 

If p=2, and if there exists in G an element z inducing the inversion in E, 
then G/G¢ is of order 2, since it is cyclic; and G? = G3, since every element in G, 
not in Gc, induces the inversion in E, and since it follows from the C-complete- 
ness of G and Theorem III.1.2 that y?=1, if y induces. the inversion in E. 
On the other hand we infer from Theorem IV.2.1 that the C-character group 
of G is not the only complete group of C-characters of G. Consequently we 
may assume in the following that G does not contain an element inducing the 
inversion in E, if p=2. 

Since G/G¢ is cyclic, there exists an element g in G which generates G 
modulo Ge. If p is odd, then the automorphism induced by g in E is not of 
order a power of p; and we infer from II.1.A that the integer g—1 is prime to 
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p. The automorphism g of E does not leave invariant any element different 
from 1; and hence it follows from thé C-completeness of G and from Theo- 
rem III.1.2 that 1 is the only element in Ge permuting with g. This implies 
in particular that the element g and the automorphism g of E are of equal 
order n. 
If p=2, then the integer g corresponding to g under C is of the form 
g= —1+2%g’ modulo m for g’ an odd number and 1 <j <u, since the integer 
corresponding to a square in G cannot be congruent to —1 modulo 4, and 
since G does not contain an inversion of E. We infer from II.1.B that the auto- 
morphism g of E is of order 2*~’. Since the automorphism g of E leaves only 
the elements in E whose order is a divisor of 2 invariant, it follows from the 
C-completeness of G and from Theorem III.1.2 that g permutes only with 
those elements in Gc whose orders are divisors of 2; and this implies in par- 
ticular that the element g*=g”” in Ge is either 1 or of order 2. 

We distinguish several cases. 

Case 1. If p is odd, then condition (1) is satisfied by G; and if p=2, then the 
order of G/Gc is divisible by 4. We prove some lemmas. 


(1.A) Every complete group B of C-characters of G contains a character b 
such that b(g*) =1 implies g*=1. 


The complete group B of C-characters of G induces in the cyclic group 


{g} generated by the element g a complete group B of C-characters. If p is 
odd and m=), then we infer from Theorem II.5.2 that the C-character group 
of {g} is a cyclic group of order p which is clearly generated by every element 
not 1. Since B cannot be 1, there exists in B a character b such that b(g) ¥1 
and this character 6 generates the full C-character group of {g} and has 
therefore the desired qualities. Assume now that mp. Then there exists a 
divisor m’ of the order of the automorphism g of E such that g’’=g”’ is an 
automorphism of order p of E. It is a consequence of II.1 that the integer 
corresponding to g’’ under C is of the form g’’=1+ p*~'g’"’ modulo m for 
g’’’ an integer prime to p. There exists in B a character b such that b(g’’) #1; 
and if p=2, but g* #1, then b may and shall be required to satisfy b(g*) ¥1. If 
p=2, g*=1, then g’’ is the only element of order 2 in { g}; and if p=2, g*¥1, 
then g* is the only element of order 2 in {g}; and thus p=2 and b(g‘) =1 
imply that g‘=1. If p is odd, then {g} c=1, as has been pointed out before; 
and it is a consequence of Theorem II.3.2 that every C-character of {g} be- 
longs to the principal genus. Hence there exists an element v in E such that 
b(g*) =v'-*". Since 1b(g’’) =v-»*"'#’”’, it follows that v is of order p* and that 
therefore v generates E. If 1=5(g*) for some i, then g‘ consequently leaves 
every element in E invariant; and this implies g‘=1, since the element g in G 
and the automorphism g of E have the same order n. 
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(1.B) If the C-character d of {g} does not map any element except 1 upon 1, 
then every C-character of {g} is a power of d. 


If m=, then the C-character group of { g} is by Theorem II.5.2 a cyclic 
group of order ~ which is generated by each of its elements excepting 1. If 
mp, and if g*=1 in case p=2, then it follows from Theorem II.3.2 that 
every C-character of { g} belongs to the principal genus, that therefore 
d(g‘) =v'-*' for v in E, and that v is an element of order m in E, since 1d(g’’) 
for g’’=g”’ an automorphism of order p of EZ; and it is readily seen that every 
C-character of {g} is a power of d. (Note that g’’=1+p*—1g’"" modulo m for 
g’’’ an integer prime to p.) If p=2™m, and if g* 1, then(*) one verifies in 
the same way that d? generates the principal genus of {g} and that therefore 
every C-character of {g} is a power of d, since it coincides with d or d? on g*. 

Suppose now that B is a complete group of C-characters of the group G. 
Then we denote by By the group of all those C-characters in B which map g 
upon 1. 

Consider an element w in Gc whose order modulo { g} is p. If p is odd then 
the cross-cut of {g} and Gc is 1 so that w is an element of order p in Ge. If 
p =2, then the cross-cut of { g} and Ge is either 1 or generated by the element 
g* of order 2; thus w is either of order 2 or w* = g* and w is of order 4. 

It is a consequence of (1.A) that there exists in B a C-character b such 
that b(g*) =1 implies g‘=1; and we deduce from (1.B) that every C-character 
of {g} is induced by some power of 0. 

If b(w) =1, then we deduce from the completeness of B the existence of a 
C-character f in B such that f(w) #1; and there exists (by the remark made 
in the preceding paragraph) an integer k such that f and 6* coincide on {g}. 
Clearly f:=fb-* belongs to By and satisfies f;(w) =f(w)b(w)-* =f(w) #1, show- 
ing that w is not mapped upon 1 by Bo. 

If b(w) #1, but w is of order p, then b(w) is of order p, since b induces a 
Co-character in Gc. There exists an integer r such that b(g") is of order p; 
for if m=p and p is odd, then take r=1; if p is odd, but mp), then choose r 
in such a fashion that g’ induces an automorphism of order p in E, and a like 
choice is appropriate if p=2 and g* = 1; if g* #1, then b(g*) is of order 2. Since 
b(g*) and b(w) are of equal order p, there exists an integer s, prime to p, such 
that b(g") =b(w)*=b(w*); and we have 6(g"w~*) =b(g")b(w)-*=1. From the 
completeness of B we infer the existence of a C-character f, in B such that 
fo(g"w-*) 1; and there exists an integer k’ such that and b*’ coincide on { g}. 
Clearly fs=f:b-" is a C-character in Bo, satisfying =fs(g’w~*) 


(5) Another proof of this fact: from d(g*) =d(g’’)!*0” =d(g’’)* one derives that d is a C-char- 
acter of order m. But it follows from Theorem I1.5.2 that m is exactly the order of the C-charac- 


ter group of the cyclic group {g}. 
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= #1; and from f;(w)-*=f3(w~*) we infer 
f:(w) #1, showing that again w is not mapped upon 1 by Bo. 

If b(w) #1, but w is not of order p, then p=2, g* #1 and w?=g*. Denote 
by g’’ an element in {g} inducing in E an automorphism of order 2. Then 
g*=g'’*, and the integer corresponding to g’”’ is of the form g’’=1+2+-! 
modulo m. Since b(g*) =b(g’"2) =b(g’’)'+#”’ =b(g’")?+?" is of order 2, it fol- 
lows that b(g’’) is of order 4 whereas b(w) is of order 2 or 4 and we find again 
an integer r such that b(w) and b(g*) are of equal order, an odd integer s 
such that b(w)*=6(g"); and we may now show, exactly as in the preceding 
paragraph, that w is not mapped upon 1 by Bo. 

Thus we have shown that all the hypotheses of Lemma IV.1.3 are satisfied 
by the group B; and hence it follows that B is the full C-character group of G. 

Case 2. p=2, the order of G/Gc is 2 and condition (2) is satisfied by G. 
Then the integer g corresponding to the element g in G is of the form 
g=-—1+2*"' modulo m; and g*=g? is an element in Ge such that g**=1. 
Since G? $Gce, and since every element in G, not in Gc, has the form xg for x 
in Ge, it follows from (xg)? =x!+¢g* =x" 'g* that G? is obtained by adjoining 
g* to GZ; and it follows from (2) that g* is not a square in Gg so that in par- 
ticular g* ¥1. 

There exists in every complete group B of C-characters of G a C-character 
b such that b(g*) #1. Since g* is an element of order 2 in Ge, b(g*) is an element 
of order 2; and we deduce‘from b(g*) =b(g?) =b(g)!+# =6(g)”” that b(g) is an 
element of order 2*=m in E, that is b(g) generates E; and this implies in 
particular that every C-character of {g} is induced by a power of b. 

If w is an element in Gc whose order modulo {g} is 2, then w?=1, since 
w*=g* contradicts (2). Clearly b(w) is either an element of order 2, as is 
b(g*), or is 1, as is b(g*); and thus there exists an integer r such that b(g’) 
and }b(w) are of equal order. Hence there exists an odd integer s such that 
b(w)* =b(g"). Now we may show by the same arguments, as used in Case 1 
above, that there exists a C-character in B which maps g upon 1 and w upon 
an element not 1; and we infer from Lemma IV.1.3 that the complete group B 
of C-characters of G is the full C-character group of G. 

This completes the proof of the sufficiency of the conditions of our theo- 
rem. 

Case 3. p is odd, but condition (1) is not satisfied by G. Then 1<y and the 
order of G/Gc¢ is different from 1 and prime to p. It is a consequence of II.1 
that the integers corresponding to elements y, not in Ge, meet the require- 
ment y—1 is prime to p. 

Consider the group B of all the C-characters f of G, satisfying f(g) is in E. 

Since the cross-cut of {g} and Ge is 1, it follows from the C-completeness 
of G and Lemma I.3.4 that every Co-character of Gc in E is induced by one 
and only one C-character in B which maps g upon 1. Hence it follows from 
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Theorem I.1.4 that every element y in G which is not in {g} is mapped upon 

elements, not 1, by C-characters in B, since y=xg‘ for x #1 in Ge. If vis an 

element, not 1, in Z”, then v'~* is a C-character in B satisfying v'-0 #1 for 

g'~1; and thus we have shown the completeness of the group B. 

But B is not the full C-character group of G, since the C-character e!~¢ 
for e a generator of E cannot be in B, as e'~* is not in E”. Thus we have shown 

that condition (1) is a necessary condition. 

Case 4. p=2, but condition (2) is not satisfied by G. Then G/Ge¢ is of order 2; 
the integer g corresponding to the element g in G satisfies g=—1+2+-! 
modulo m; and the element g* =g? whose order is 1 or 2 belongs to G2; and 
2 <u, since g is not in Ge. 

Case 4.1. g* =1. Consider the group B of all the C-characters of G which 
map g upon an element in E*. 

Then it is readily verified that there exists to every element not in {g} 
a C-character in B which does not map it upon 1. If v generates Z, then the 
C-character v?-® js in B, since v2-9) =y*; and g is not mapped upon 1 by 
this C-character, since v‘+1. But B does not contain the C-character v'~?, 
that is, B is a complete group of C-characters which is not the full C-charac- 
ter group. 

Case 4.2. g* #1. Then g* is of order 2 and there exists an element w in Gc 
such that w?=g*. There exists, because of the C-completeness of G, a C-char- 
acter f of G which maps g* upon an element of order 2; and we denote by B 
the group of C-characters of G which is generated by f and by those C-charac- 
ters of G which map g and w upon 1. 

Since every Co-character of Gc which maps w (and therefore g*) upon 1 
is induced by one and only one C-character in B which maps g upon 1, it 
follows that there exists to every element y in G which is not in the subgroup 
W generated by g and w a C-character in B which does not map y upon 1. 

Since f(g*) is of order 2, and since f(g*) =f(g?) =f(g)!** =f(g)”” “, it follows 
that f(g) is of order m. Since the order 4 of w and f(w) is smaller than m, and 
since every element in W, not in Ge, is of the form gw’, it follows that 
S(gw*) =f(g)f(w)'¥1, that the elements, not in Wc, are mapped upon ele- 
ments, not 1, by C-characters in B. Since W¢ is generated by w, and since 
f(w*) =f(g*) ¥1, we see finally that B is a complete group of C-characters of G. 

From the argument used in the second paragraph of this Case 4.2 and 
from the fact that the order of the C-character f is m, since f(g) is of order m, 
it follows that the order of B is exactly 2“-* times the order of Gc. But it 
follows from Theorem II.5.2 that the order of the full C-character group of G 
is exactly 2*—-! times the order of Ge so that B is not the full C-character 
group of G. 

This completes the proof of the necessity of condition (2); and we have 
completed the proof of our theorem. 
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IV.3. The case of primary E and non-cyclic G/Gc. Since a primary cyclic 
group whose group of automorphisms is not cyclic is of necessity of order a 
power of 2 [at least 8], we shall impose throughout this section the following 
hypotheses(*). 

(IV.3.*) E is a cyclic group of order m =2* where 2<y, C is a homomor- 
phism of the group G into the group of automorphisms of E such that G/Ge 
is not cyclic, and G is C-complete. 

It will be convenient to derive first some general properties of such groups 
and their C-characters. 


Lemma IV.3.1. (a) There exists a basis g, 2 of G modulo Ge such that z in- 
duces an inversion in E and such that the integer corresponding to the element g 
under C is of the form g=—1+2'go modulo m where go is odd and 1<j <u. 

(b) If g, 2 is a basis of G modulo Gc, meeting the requirements of (a), if 
n=2" ts the order of the automorphism g of E (and therefore half the order 
of G/Gc), if g*=g" and G* the group generated by g and 2, then z*=g**=1, 
G=GcG*, Gd is generated by g* and the commutator |g, z] whose order 2" is a 
divisor of 

(c) If y, 2 are elements in G such that 2 induces the inversion in E, ana if f 
is a C-character of the group generated by y and 2, then f(|z, y])¥=f(z)'-*f(y)-?; 
and the C-character f* belongs to the principal genus of the group generated by y 
and z if, and only if, f([z,y]) =1. 


It should be noted that the commutator of two elements 2, w is defined by 
[v, w | 

Proof. It is an immediate consequence of our hypotheses (IV.3.*) and of 
II.1.B that there exists a basis y, z of G modulo G¢ such that z is an inversion 
of E; and it is readily verified that either the basis y, z or the basis yz, z of 
G modulo Ge meets the requirements of (a). 

If the basis g, z of G modulo Gc meets the requirements enunciated in (a), 
then the automorphism g of E leaves only the elements of an order dividing 2 
invariant; and it follows from the C-completeness of G and Theorem III.1.2 
that z?=g** =1. Since the group of automorphisms of E which are induced by 
elements in G is the direct product of a cyclic group of order 2 by a cyclic 
group of order n, and since this group is isomorphic to G*/G¢, it follows that 
Gé is generated by g* and the commutator [g, z]. This completes the proof of 
(b) with the exception of the assertion concerning the order of [g, z]. 

Suppose now that the element z in G induces the inversion in E, that y 
is some element in G, that Y is the subgroup generated by y and z, and that f 
is some C-character of Y. Then 


(*) Without restating them in the hypotheses of the various theorems of this section. 
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f(y)? = = fer) f(y) 
= = f[(y2)2y] = fley zy); 


and substituting y~' for y we obtain: 


= f(2)* f(y). 


Since y is odd and 1—y is even, it follows from this formula that 
f([z, =f([z, and we infer from the C-completeness of G 
that [z, y]*”’=1, completing the proof of (b). 

If f?(x) =v'-* for v in E and x in Y, then we infer from the above formula 
and the evenness of 1—y that f([z, y])¥=00-9G-»2"'-G-» =1 so that 
f({z, y]) =1; and if conversely f([z, y])=1, then it follows from the above 
formula that f(y)? =f(z)'~*"; and it follows from Theorem II.3.1(2) that f? be- 
longs to the principal genus of Y. 


Lemma IV.3.2. If g, 2 is a basis of G modulo Gc, and if 2 induces the inver- 
ston in E, then the commutator subgroup G’ of G is obtained by adjoining the 
commutator [g, 2] to G2. 


Proof. If x is in Gc and y in G, then the commutator x'!-v= [x, y-"] is 
in Gg, since 1—¥y is even; and in particular x'~*=x?. Now it is clear how to 
complete the proof, since subgroups of G¢ are by Theorem I.3.1 and Corollary 
I.3.2 normal subgroups of G. 


Lemma IV.3.3. If G’=G?, then there exists a strictly complete group of 
C-characters of G which is not the full C-character group of G. 


Proof. Suppose that the elements g, z in G form a basis of G modulo Gg 
which meets the requirements of Lemma IV.3.1(a). It is a consequence of our 
hypcothesis that the commutator [g, z] belongs to GZ and that there exists 
therefore an element w in Ge, satisfying (w?'O-”“e)?=[z, g], since 
g~(1—27-*go) is odd. It is clear that the elements g, wz form a basis of 
G modulo Ge which meets the requirements of Lemma IV.3.1(a); and one 
finds that 


[g, we) = = = = 1. 


Hence we may assume without loss of generality that the basis g, z of 
G modulo Ge satisfies beyond the conditions of Lemma IV.3.1(a) the addi- 
tional condition [g, z]=1 so that the subgroup G* of G which is generated 
by g and z is an abelian subgroup. It is a consequence of Lemma IV.3.1(b) 
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that G¢* is generated by the element g* =g* for m the order of the automor- 
phism g of EZ, and that g*?=1. 

Case 1. g*=1. Then we consider the group B of C-characters of G which 
is generated by the principal genus of G and by all those C-characters of G 
which map G* upon 1. Since B contains the principal genus, elements not in Gc 
are not mapped upon 1 by all the C-characters in B. Since 1 is the cross-cut 
of G* and Ge, it follows from Lemma I.3.4 that every Co-character of Gc in E 
is induced by one and only one C-character of G which maps G* upon 1 
and which therefore belongs to B; and thus it follows from the C-complete- 
ness of G and Theorem I.1.4 that 1 is the only element in Ge (and therefore 
in G) which is mapped upon 1 by all the C-characters in B. Thus B is shown 
to be strictly complete. Since B is the direct product of the principal genus 
and of the group of C-characters of G which map G* upon 1, and since the 
order of the latter group is by Theorem I.1.3 and the C-completeness of G 
(and by a previous remark) exactly the order of Gc, it follows from Theorem 
II.2.2(a) that the order of B is 2*—! times the order of Gc. On the other hand 
we infer from Theorem II.5.2 that the order of the full C-character group of G 
is exactly 2" times the order of Ge showing that B cannot be the full C-char- 
acter group of G. 

Case 2. g*#1. The cross-cut Gc* of G* and Ge is in this case by Lemma 
IV.3.1(b) the cyclic group of order 2 which is generated by g*. 

Denote by B* the group of all C-characters of G which map G* upon 1. 
It is an immediate consequence of the C-completeness of G, of Lemma I.3.4 
and Theorem I.1.4 that every Co-character of Gc in E which maps g* upon 1 
is induced by one and only one C-character in B*; and it follows from Theo- 
rem I.1.3 that the order of B* is half the order of Gc; and that the elements 
not in G* are mapped upon elements not 1 by characters in B*. 

Denote by B** the group of C-characters of G which is generated by ad- 
joining the principal genus of G to B*. The cross-cut of B* and the principal 
genus of G is evidently 1. Thus the order of B** is 2*-? times the order of Ge, 
since the order of the principal genus is by Theorem II.2.2(a) exactly 2-'. 
Clearly the elements, not in Gc*, are mapped upon elements different from 1 
by suitable C-characters in B**. 

Since G is C-complete, there exists a C-character f of G such that f(g*) ¥1. 
This C-character f does not belong to B**, since the characters in B* as well as 
in the principal genus map g* upon 1. From [g, z] =1 we infer f([g, z]) =1; and 
hence it follows from Lemma IV.3.1(c) that f? induces in G* a character of 
the principal genus of G*. There exists therefore an element v in EZ such that 
f?(x) =v'-* for x in G* so that the C-character b(x) =f?(x)v*—' belongs to B*. 
Hence f? itself belongs to B**; that is, f is of order 2 modulo B**. We denote 
by B the group of C-characters of G which is generated by adjoining f to B**. 
It follows from what we have shown thus far that B is a strictly complete 
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group of C-characters of G; and that the order of B is 2—' times the order 
of Ge, since the order of B is twice the order of B**. But we infer again from 
Theorem II.5.2(a) that the order of the C-character group of G is exactly 2* 
times the order of Ge showing that the strictly complete group B of C-char- 
acters of G is not the full C-character group of G; and this completes the proof. 

It is a consequence of Lemma I'V.3.1(a) that G/Ge¢ is the direct product 
of a cyclic group Z and a cyclic group Z’ where the elements in Z induce 
in E the inversion (so that Z is of order 2) whereas Z’ is generated by an ele- 
ment to which corresponds under C an integer of the form — 1+2‘go modulo m 
where go is odd and 1<j<uy. It is readily seen that Z and Z’ are uniquely de- 
termined by these properties. The normal subgroup A =A(G, C) of G which 
contains Gc and satisfies Z’=A/Gc is consequently uniquely determined by 
G and C. 


Lemma IV.3.4. If the basis g, z of G modulo Gc meets the requirements of 
Lemma IV.3.1(a), if m is the order of the automorphism g of E and g*=g", 
then A is obtained by adjoining g to Gc; and the cross-cut of A* and Ge is gen- 
erated by g* and Go. 


Proof. It is clear that the coset Gc, is a generator of the cyclic group 
Z'=A/G¢; and that therefore A is obtained by adjoining g to Gc. The ele- 
ments in A whose squares are in Ge are clearly contained in the cosets Gc 
and Gcg** ' of A/Gc, as these make up the cyclic subgroup of order 2 of 2’. 
The cross-cut of the groups A? and Ge therefore certainly contains the ele- 


ments in G? and the element (g**"*)?=g*. Any further element is of the form 
(xg”?')? =x1+e-"g* for x in Gc; and this element is contained in the group 
generated by adjoining g* to G?, since the exponent of x is certainly an even 
integer; and this completes the proof. 


Lemma IV.3.5. If the order of G/Gc is not 4 (and therefore divisible by 8), 
af the commutator subgroup G’ S A®*, then there exists a complete group of C-char- 
acters of G which is not the full C-character group of G. 


Proof. This is an immediate consequence of Lemma IV.3.3, in case 
G’ = G2; and hence we may assume that G2<G’ S$ A*. Since G’ $Ge, and since 
the order of G’/G? as well as the order of(2”) (Gc/\A?)/G? is at most 2 (as 
follows from Lemmas IV.3.2 and IV.3.4), it follows that G2<G’=Gc/\A?. 
If g, z is some basis of G modulo Ge which meets the requirements of Lemma 
IV.3.1(a), then it follows from Lemmas IV.3.2 and IV.3.4 that [g, z]=g* 
modulo GZ where g*=g* for m the order of the automorphism g of E; and 
that neither [g, z] nor g* is in G3 (so that g* is, by Lemma IV.3.1(b), of order 
2). Since the integer (1—g)g-' is of the form (1—g)g-!=2(1—2*-'go)g—! 


(7) S(\T denotes the cross-cut of the sets S and T. 
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modulo m where the factor of 2 is odd, there exists an element w in Ge 
such that g* = [g, wz]. Since the elements g, wz form, exactly 
as g, 2, a basis of G/Ge which meets the requirements of Lemma IV.3.1(a), 
we may assume without loss of generality that g, z is a basis of G modulo Gz 
rage meets the requirements of Lemma IV.3.1(a) and which satisfies 
[g, 2] 

We note that the order of the automorphism g of E is a multiple of 4 
(or 1<r), since the order of G/Gc¢ is a multiple of 8. 

Since g* is an element of order 2 in Ge which is not a square in Gc there 
exists a Co-character f; of Gc in E which maps GZ, but not g* upon 1 (ep. 
Theorem 1.1.4!); and it is clear that ff=1. Since G is C-complete, we infer 
from Theorem I.3.1 the existence of a C-character f of G which induces f; 
in Gc. Since m is the order of the automorphism g of E, and since g* and 
f(g*) are elements of order 2, we infer from II.'.B that 


S(g*) = fg") = = 


and this shows that f(g) is an element of order 2“, so that f(g) generates E. 
From Lemma IV.3.1(c) we infer that 


fe)? = = fle)" = 


showing that f(z)? is of order 2—' and f(z) of order 2+. 

The element g** ‘ induces in E an automorphism of order 2 and the integer 
corresponding to g** ' under C is of the form 1+2—' modulo m, since n2- 
is even. Hence we find that 


since f(g*) is of order 2 and since 1+2*-? is odd. 
The C-character f? of G maps Ge upon 1. If wis an element in G, not in Ge, 


whose order modulo Ge is two, then 


w= or g™ or 's modulo Ge. 


Accordingly we find that f(w)?=f(z)? or f(g** or ')~*f(z)*. In the first 
of these cases f(w)* is of order 2*~'; in the second case f(w)* is of order 2; 
and in the third case f(w)? is of order 2*—', since it is the product of an element 
of order 2 by an element of order 2—'. In all three cases f(w)? 1; and this 
shows that f? maps an element upon 1 if, and only if, this element is in Ge. 

Consider now the group JB; of all the C-characters of G which map g and z 
upon 1. If G* is the subgroup, generated by g and 2, then the cross-cut G¢ 
of G* and Ge is, by Lemma IV.3.1(b), the cyclic subgroup of order 2 gener- 
ated by g*. Hence it follows from the C-completeness of G and Lemma I.3.4 
that every Co-character of Gc in E which maps g* upon 1 is induced by one 
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and only one C-character in B,. Thus the order of B, is half the order of Ge; 
and there exists to every element in Ge, not in GZ, a C-character in B, which 
does not map it upon 1. 

Denote by B the group of C-characters of G, generated by f and B;. The 
order of B is 2“ times the order of B,, since the order of f(z) is 2“, that is, the 
order of B is 2*—! times the order of Ge. The group B is complete, since ele- 
ments not in Gc are mapped upon elements, not 1, by f, since the elements in 
Ge, not in Gc*, are mapped by C-characters in B, upon elements different 
from 1, and since f(g*) #1. But it follows from Theorem II.5.2 that the order 
of the full C-character group of G is 2“ times the order of Gc, showing that the 
complete group B of C-characters of G is not the full C-character group of G. 


Lemma IV.3.6. If G/Ge és of order 4, if A?=G?, then there exists a complete 
group of C-characters of G which is not the full C-character group of G. 


Note that G? SGe, since G/Ge¢ is the direct product of two cyclic groups of 
order 2. 

Proof. It is a consequence of Corollary III.1.3 that every C-character of 
the subgroup A of G is induced by some C-characters of G, since G is C-com- 
plete. It is a consequence of A?=G? and of Theorem IV.2.2(a) that there 
exists a complete group D of C-characters of A which is not the group of 
all the C-characters of A. Denote by B the group of all the C-characters of G 
which induce in A C-characters in D. Since not every C-character of A is 
induced by C-characters in B, and since every C-character of A is induced by 
some C-character of G, it follows that B is not the full C-character group of G. 

There exists a C-character f of G which maps A upon 1 and the elements, 
not in A, upon the element of order 2 in E, since G/A is of order 2, and since 
the automorphisms of E leave the element of order 2 invariant. Clearly f 
belongs to B; and thus it follows that B is complete, since B induces in A the 
complete group D. 


THEOREM IV.3.7. The C-character group of G is the only complete group of 
C-characters of G if, and only if, 

(i) G’ és not part of A* in case the order of G/Gc is a multiple of 8; and 

(ii) neither G’ nor A? is equal to G2 in case G/Gc is of order 4. 


Proof. The necessity of condition (i) is a restatement of Lemma IV.3.5 
whereas the necessity of (ii) may be derived from Lemmas IV.3.3 and IV.3.6. 

In the proof of the sufficiency of the conditions (i), (ii) we distinguish two 
cases: 
A. G’>A?*. We use a basis g, z of G modulo Ge which meets the require- 
ments of Lemma IV.3.1(a), denote by m the order of the automorphism g 
and put g*=g*. By G* we denote the subgroup generated by g and z and 
by A* the subgroup generated by g and [g, z]. Then it follows from Lemma 
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IV.3.1 that Got =A¢* is generated by g* and [g, z] where g*?=1 and where 
the order of the commutator [g, z] shall be denoted by 2*. It is a consequence 
of Lemmas IV.3.2 and IV.3.4 that the hypothesis A is equivalent to the two 
statements: 

A’. [g, 2]#g* modulo G2 and [g, z]#1 modulo G3. 

Suppose now that B be ‘a complete group of C-characters of G. Then B 
induces both in A and in A* a complete group of C-characters; and it is a 
consequence of condition (ii) and Theorem IV.2.2(2) that: 

(*) Every C-character of A, A* and of the cyclic group { gh generated by g 
is induced by some C-character in B. 

It is a consequence of Theorem II.5.2 that the order of the [cyclic] C-char- 
acter group of(?*) {g} is 2, if g*~1, and 2*-', if g*=1. Hence there exists a 
C-character }; in B such that 0:(g) is of order 2+, if g*¥1, and of order 2*—, 
if g*=1. If f is any C-character of G, then there exists an integer 7 such that 
bi(g)‘=f(g). Consequently there exists a C-character in B such that 
bo(g) =1 and such that the order of b2([g, z]) is 2*, if the cross-cut of the 
cyclic groups {g} and {[g, z]} is 1, and such that the order of }.([g, z]) 
is 2'-1, if g*1 is a power of [g, z]. It is a consequence of A’ that 1<h, if 
g*1 is a power of [g, z] so that dy is different from 1 on A*. These two 
C-characters b; and b, are independent on A*; and the order of the group of 
C-characters of A* induced by d:, b: and their products is exactly 2*~' times 
the order of A% so that every C-character of A* is induced by a combination 
of & and dy. 

It is a consequence of Lemma IV.3.1(c) that 


be([z, g]) = = 


and that therefore the order of b2(z) is twice the order of 2([z, g]). The order 
of the C-character of G*, induced by be, is therefore twice the order of the 
C-character of A*, induced by be, since the latter is the same as the order of 
bo([z, g]). The group of C-characters of G*, induced by the combinations of 
b, and be, has therefore the order 2* times the order of Gc* (note that 
G¢* =A); and thus it follows from Theorem [1.5.2 that 

(**) every C-character of G* is induced by some combination of the C-charac- 
ters b, and bz in B. 

Suppose now that W, is a coset of order 2 in Gc/Gc* and that W is the 
subgroup of G, generated by adjoining Wo to A*. It is a consequence of hy- 
pothesis A’ that Wo contains an element w satisfying w*=1 or w?=g* (the 
latter can happen only if the order m of the automorphism g of E£ is not 2). 
Since 5,(g*) =1, it follows that b.(w) is an element of order 1 or 2 showing 
that b, induces C-characters of equal order in W and in A*. The order of 


(?*) It should be remembered that { g} denotes the cyclic group, generated by the element g. 
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b,(w) is a divisor of 4 and 4 is a divisor of 2—' and therefore of the order of }y, 
showing that }; induces C-characters of equal order in W and in A*. Thus 
it follows that the group D of C-characters, generated by }; and de, induces 
in W a group of C-characters whose order is 2*—' times the order of A@. 
Since w is not in A%, and is of order 2 modulo AZ, it follows from Theorem 
II.5.2 that the C-character group of W is of order 2* times the order of AZ; and 
hence there exists a C-character r of W which is not induced by C-characters 
in D. But WS4A; and hence it follows from (ii) and Theorem IV.2.2(2) that 
every C-character of W is induced by some C-character in B. In particular 
there exists a C-character } in B which induces r in W. We infer from (**) 
the existence of integers i’, i’’ such that b and b{d{' coincide on G*; and we 
put b;=bb;"br*”’. This C-character b; certainly belongs to the group B* of 
all the C-characters in B which map G* upon 1. Since b; cannot map every 
element in W upon 1, and since 6s maps every element in A* upon 1 (as A* 
is part of G*), it follows that b;(w) #1; and this implies b3(Wo) #1. Hence 
there exists to every element in Gc whose order modulo G* (or Gi=A?) 
is 2 a C-character in B* which does not map it upon 1. Now we infer from (**) 
and Lemma IV.1.3 that B is the full C-character group of G, as was to be 
shown. 

B. G’ SA’. In this case we infer from (i) that the order of G/Ge is 4. Using 
condition (ii) we infer now—exactly as in the beginning of the proof of Lemma 
IV.3.5—the existence of a basis g, z of G modulo Gc meeting the requirements 
of Lemma IV.3.1(a) and satisfying: 

B’. g?=g* = [g, z] is an element of order 2, not in G2. We note that the 
integer corresponding under C to the element g in G is of the form g=—1 
+2+-! modulo m; and we verify that 


2] = = = z]e* = [g, 2]. 


Suppse now that B is a complete group of C-characters of G. Then there 
exists in B a C-character such that b(g*)#1. Hence 6(g*) =b(g*) =b(g)!+¢ 
=b(g)** is an element of order 2; and 5(g) is an element of order 2“. From 
Lemma IV.3.1(c) we derive b(g*)b(g)? = b(g*) °b(g)? = =b(z)2+2"". Since 
b(g)? is of order 2*—! and 6(g*) of order 2, we deduce from this last equation 
that b(z) is of order 2* so that b(z)*"™’ is the only element of order 2 in E, 
namely b(g*); and hence it follows that b(g)?=b(z)?. 

If b(g) =b(z), then b(gz) = b(g)—b(z) =1. If b(g) ¥b(z), then it follows from 
b(g)* =6(z)? that b(z) =b(g)b(g*) =b(gg*) =b(g-") since b(g*) is the element of 
order 2 in E; and this shows that b(g~!z) = b(g—')—1b(z) =1. In both cases there 
exists therefore in the group G*, generated by g and z, an element which is 
not contained in Ge and which is mapped upon 1 by the C-character b. Since 
B induces a complete group of C-characters in G*, there exists a C-character b’ 
in B which does not coincide with any power of b on G*. Denote by D the 
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subgroup of B, generated by b and b’. Since 6 is of order 2* on G*, it follows 
that D induces in G* a group of C-characters whose order is at least 2**?, 
But this is by Theorem II.5.2 exactly the order of the C-character group of G*, 
since G¢ is by Lemma IV.3.1(2) and condition B’ the cyclic group of order 2, 
generated by g*. Thus we have shown: 

(+) Every C-character of G* is induced by some C-character in DSB. 

Since b(g) is an element of order 2“, there exists an integer i such that 
b(g)*=b’(g). Put bo) =b’b-*. Then b, bo generate D and dy is different from 1 
on G*, though bo(g) = 1. Since this implies bo([c, z]) =bo(g*) =bo(g?) =1, it fol- 
lows from Lemma IV.3.1(c) that do(z) is an element of order 2. 

We denote again by B* the group of all those C-characters in B which 
map G* upon 1. 

If Wo is a coset of Gc/Gé whose order is 2, then it follows from (ii) and 
condition B’ that Wp» contains an element w of order 2 which is not in G¢*, 
that is w#g*. The complete group B of C-characters of G induces in Gc a 
complete group of Co-characters; and it follows from Theorem I.1.4 that every 
Co-character of Gc in E is induced by some C-character in B. Hence it follows 
from Lemma I.1.1(b) that B contains a C-character which maps w, but not g*, 
upon 1; and we may assume without loss of generality that the C-character ), 
considered before, has this property. 

If bo(w) =1, then let f be any C-character in B such that f(w) #1 (the 
existence of f is a consequence of w~1 and of the completeness of B). There 
exist integers 1, i9 such that f and 5b‘bj* coincide on G*, as follows from (+); 
and hence f* =fb-‘be** is in B*. But f*(w) =f(w) #1 so that B* contains a 
C-character which does not map W, upon 1. 

Suppose next that bo(w) #1. Then bo(w) is of order 2, that is bo(w) =bo(z). 
It has been shown before that either b(gz) or b(g~'z) is equal to 1. Hence 
determine ¢= +1 in such a fashion that b(g*s)=1. Then b(wg*z)=1 and 
bo(wg*z) = bo(w)bo(g*z) = bo(w)bo(z) =1, since do(g)=1, and since bo(w) =do(z) 
is of order 2. From the completeness of B we infer the existence of a C-char- 
acter t in B such that ¢(wg*z) #1; and from (+) we deduce the existence of 
integers k, ko such that (*=ib-*by** is in B* (maps G* upon 1). Hence 
t*(Wo) =t*(w) =t*(w) “*t*(g*z), since ¢*(w) is of order 2 and é*(g*z) is equal 
to 1; and therefore we find that ¢*(Wo) =é*(wg*z) =t(wg*z) ¥1, since b(wg*s) 
= bo(wg*z) = 1; and we have shown again the existence of a C-character in B* 
which does not map W» upon 1. 

But now it is an immediate inference from (+) and Lemma IV.1.3 that 
B is the full C-character group of G; and this completes the proof of the theo- 
rem. 


THEOREM IV.3.8. The C-character group of G is the only strictly complete 
group of C-characters of G if, and only if, G2<G’. 
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Proof. It is a consequence of Lemma IV.3.2 that G2 $G’. Hence we may 
deduce the necessity of our condition from Lemma IV.3.3. 

Assume now that G2<G’. We distinguish two cases: 

Case 1. The order of G/Gc is 4. If A?#G?, then our contention is a con- 
sequence of Theorem IV.3.7. Thus we assume now that A?=G?. Consider a 
basis g, z of G modulo Ge meeting the requirements of Lemma IV.3.1(a). 
Then g* = g? is in G3. 

If B is a strictly complete group of C-characters of G, then B induces in A 
a strictly complete group of C-characters; and it follows from Theorem IV.2.1 
that B induces in A every C-character of A. It is a consequence of the C-com- 
pleteness of G and A and of Lemma I.3.4 that there exists a C-character of A 
which maps g and G? upon 1 but which maps [g, z] upon an element of order 
2, since g* is in G3 whereas [g, 2] does not belong to G3; and hence there exists 
in B a C-character 6 which maps both g and G? upon 1 though 0([g, z]) is of 
order 2. Hence it follows from Lemma IV.3.1(c) that b(z)!-* =6([(z, g])*; and 
this implies b(z)?=0([z, g]), since b([z, g]) is an element of order 2. The 
C-character b? of G has therefore the properties b?(A) =1, 5*(z) is of order 2. 
Hence it follows that every C-character of A is induced by at least two C-char- 
acters in B. The order of the C-character group of A is by Theorem II.5.2 
exactly 2“—! times the order of Gc; and the order of the C-character group of G 
is exactly 2* times the order of Gc; and from these facts one deduces that B 
is the full C-character group of G. 

Case 2. The order of G/Ge its divisible by 8. If G’ is not part of A*, then our 
contention is a consequence of Theorem IV.3.7. Thus we assume now that 
Gt<G’sA?*; and we prove—exactly as in the beginning of the proof of 
Lemma IV.3.5—the existence of a basis g, z of G modulo Ge which meets the 
requirements of Lemma IV.3.1(a) and which satisfies g* = g* = [g, z] is an ele- 
ment of order 2 (m denotes, as always, the order of the automorphism g of E). 

If B is a strictly complete group of C-characters of G, then B induces in A a 
strictly complete group of C-characters; and it follows from Theorem IV.2.1 
that every C-character of A is induced by some C-character in B. 

There exists a Co-character of Gc in E which maps G? upon 1 and g* upon 
an element of order 2, since g* = [g, z] is not in G3; and it follows from the 
C-completeness of G and A and from Theorem I.3.1 that this Co-character 
of Gc is induced by some C-character of A. Hence there exists a C-character 
b in B such that b(GZ)=1 and 6(g*) is the element of order 2 in E. From 
b(g*) =b(g") =b(g)*+9+- and from II.1.B we infer b(g*) is an 
element of order 2? so that b(g) is an element of order 2*. Clearly 6?(Gc) =1. 
But from 6([g, z]) #1 and Lemma IV.3.1(c) we infer that 5? does not belong 
to the principal genus. 

From our choice of g it follows that the integer corresponding to g under 
C is of the form g=—1-+2%go modulo m for go an odd integer and 1<j <u. 
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The integer 1—g is therefore divisible by 2, but not by 4; and hence there 
exists an element v in E such that b(g)?=v'-*. The C-character v'~* of G be- 
longs to B; and B contains therefore the C-character d(x) =b(x)*v*-'. This 
C-character d is not 1, since 6? has been shown not to be in the principal 
genus; but d maps both g and Gc upon 1; that is, the C-character d¥1 in B 
maps A upon 1. Now it follows that every C-character of A is induced by at 
least two C-characters in B. We deduce from Theorem II.5.2 that the order 
of the C-character group of G is exactly twice the order of the C-character 
group of A, showing that B is the full C-character group of G; and this com- 
pletes the proof. 

IV.4. Completeness and p-completeness of groups of C-characters. We 
have treated the problem of complete groups of C-characters completely for 
the special case of primary EZ. The reduction of the general case where E is 
of composite order to this special case will now be accomplished by means of 
the following considerations. 

The group B of C-characters of the group G is the direct product of its 
subgroups B, which consist of all the elements of order a power of (the prime 
number) ~, since B is abelian. A C-character of G is of order a power of if, 
and only if, it maps G into E(p) (using the notations (I1.3.*)!); and thus B, 
consists exactly of ali the Cp-characters in B. It is a consequence of the defini- 
tion of the subgroup H(p) that B,(H(p)) =1; and this leads to the following 
definition: 

The group B of C-characters of G is p-complete, if B,(x) =1 implies that x 
is in H(p). 

If the group B of C-characters of G is p-complete for every p, and if x is 
an element in G which is mapped upon 1 by B, then B,(x) =1 for every ; 
and consequently x belongs to the cross-cut H of all the H(p). If G is C-com- 
plete, and if B is p-complete for every p, then we deduce the completeness 
of B. The converse of this result need not be true; and it will be the main ob- 
ject of this section to find criteria for the validity of the converse. 


THEOREM IV.4.1. If G is C-complete, and if p is a prime, then the following 
condition is necessary and sufficient for the p-completeness of every strictly com- 
plate group of C-characters of G: 

(p) H(p)Ge contains every elemeni(**) in Gc, whose order modulo H(p) is 
exactly p. 

Proof. Assume first that condition (p) is not satisfied. Then there exists an 


element w in Ge, which is not contained in H(p)Gce, though its order 
modulo H(p) is p. We denote by B the group of C-characters of G which is 


(®) See Lemma III.2.2 and Corollary III.3.2 for properties of the subgroup H(p)Gc 
=H*(p). 
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generated by all the Cq-characters for g#p, and by all those Cp-characters 
which map w upon 1. This group B contains the principal genus, since w is 
in Gc,. Thus B(x) =1 implies that x is in Ge. If x1 is an element of order a 
power of q for g a prime different from , and if x is in Gc, then it follows from 
the C-completeness of G and from Theorem III.3.1 that x does not belong to 
H(q). It is a consequence of the definition of H(q) that G/H(qg) is Cg-com- 
plete; and hence there exists a Cg-character of G which does not map x upon 1. 
But all the Cg-characters are in B, that is B(x) #1. If x¥1 is an element of 
order a power of p in Ge, then it follows again from the C-completeness of G 
and from Theorem III.3.1 that x is not in H(p); and it follows from the fact 
that w is of order p modulo H(p), but does not belong to H(p)Gce, that H(p)w 
and H(p)x are independent elements of Gc,/H(p). Hence we deduce from 
Lemma I.1.1(b) the existence of a Co-character d of Gc,/H(p) in E(p) which 
maps H(p)w upon 1, but which does not map H(p)x upon 1. It follows from 
the Cp-completeness of G/H(p) and from Theorem I.3.1 that d is induced 
by a Cp-character of G/H(p); and d is therefore induced by a Cp-character of 
G. This Cp-character of G belongs to B, since it maps w upon 1; and thus we 
have shown that B(x) #1. From the facts derived thus far it follows that B 
is a strictly complete group of C-characters of G. But it follows from the 
definition of B that B,(w) =1, in spite of the fact that w is not in H(p), that 
is the strictly complete group B is not p-complete, showing the necessity of 
the condition (p). 

Assume conversely that (p) is satisfied, and that B is a strictly complete 
group of C-characters of G. We denote by V the subgroup of all the ele- 
ments in G which are mapped upon 1 by every Cp-character in B. Then 
H(p) S VSGep, since the principal genus is in B. Suppose now that w is an 
element in Gc, whose order modulo H(p) is p. Then it follows from condi- 
tion (p) that w is in H(p)Ge. It is a consequence of Theorem III.2.1(a) that 
H() contains all the elements in Gc whose orders are prime to p; and hence 
we may represent w in the form w=xy for x in H(p) and y an element of 
order a power of ~ in Ge; and it follows from the C-completeness of G that y 
is an element of order ~, since the order of xy modulo H(p) is just p, and since 
w=y modulo H(p). The group B is complete and contains therefore a C-char- 
acter b such that b(y) #1. Since b induces a Co-character in Gc, and since y 
is an element of order p in Ge, it follows that b(y) is of order p. The group B 
contains the C-character b:=b"» “” which maps every element in G upon 
an element in E(p); that is }; is a Cp-character and belongs to B,. But 
b,(w) =b,(y) is of order p and therefore different from 1, since the exponent 
mp~-™?) is prime to p. Thus B,(w) #1 and w does not belong to V. The sub- 
group V/H(p) of Gc,/H(p) consequently does not contain elements of order 
p; and this implies V= H(p), since Gc,/H(p) is of order a power of p. Hence B 
is p-complete, as was to be shown. 
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EXAMPLE. Every prime number ? is a divisor of g—1 for g a suitable 
prime(*°); and we denote by E the cyclic group of order p*g. There exists an 
automorphism g of the group E which induces both in E(p) and in E(g) an 
automorphism of order p. If G is the cyclic group of order p, generated by g, 
and if C is the isomorphism which maps the element g in G upon the auto- 
morphism g of EZ, then Ge=Gc,=Ge,=1 so that G is C-complete and so that 
condition (p) is satisfied. Denote by B the group of all the Cq-characters of G. 
Clearly B is a complete group of C-characters, since B is a complete group of 
Cq-characters. But B, = 1, showing that B is not p-complete. 

This shows that the hypothesis of strict completeness occurring in Theo- 
rem IV.4.1 cannot be omitted. 

REMARK. If the principal genus of G contains every C-character of G which 
maps Gc upon 1, then we infer H(p)Gc=Gc, from Theorem II.3.2 and this 
condition certainly implies condition (p) of Theorem IV.4.1. 


Coro.iary IV.4.2. The C-character group of G is the only strictly complete 
group of C-characters of G if, and only if, condition (p) of Theorem IV.4.1 is 
satisfied for every prime p, and H(2)Gt,<H(2)G’, in case C is singular. 


Proof. Suppose first that the C-character group of G is the only strictly 
complete group of C-characters of G. This implies the p-completeness of every 
strictly complete group of C-characters of G, proving the necessity of (p). 
It implies furthermore that the C2-character group of G/H(2) is the only 
strictly complete group of C2-characters of G. If C is singular, then we infer 
from Theorem IV.2.1 that G/Gez is not cyclic; and hence it follows from 
Theorem IV.3.8 that H(2)G2.<H(2)G’. 

Suppose conversely that the conditions are satisfied, and that B is a 
strictly complete group of C-characters of G. Then we infer from Theorem 
IV.4.1 that H(p) is the set of all the elements in G, mapped upon 1 by the 
group B, of the Cp-characters in B. If p is odd, or if C2 is not singular, then 
we infer from Theorem IV.2.1 that B, contains every Cp-character of G/H(p) 
and consequently of @. If C2 is singular, then G/G@ is not cyclic, since other- 
wise H(2)G¢,=H(2)G’. Thus we may infer from our hypothesis aid from The- 
orem IV.3.8 that B; contains every C2-character of G/H(2) and of G, showing 
that B is the full C-character group of G. 

In the following investigations we shall make use of the subgroups H(p). 
Each such group is defined as the cross-cut of all the subgroups H(q) for g¥p; 
and this is equivalent to saying H(p) is the set of all the elements x in G 
which are mapped upon elements in E(p) by every C-character of G. If G is 
C-complete, then the cross-cut of H(p) and H(p) is 1; and it follows from the 
fact that every C-character induces a Co-character in the abelian group Ge 
that the cross-cut of Ge and H(p) consists of all the elements in Gc whose 


As follows from Dirichlet’s theorem. 
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order is prime to p whereas the cross-cut of Gc and H(p) consists just of the 
elements of order a power of p in Ge. 


THEOREM IV.4.3. If G is C-complete, if p is a prime such that G/Gc» is 
cyclic(*"), then the following conditions are necessary and sufficient for p-com- 
pleteness of every complete group of C-characters of G: 

(p*) H(p)H(p) contains every element in G whose order modulo H(p) is 
exactly p. 

(p**) If the order of G/Gc, is prime to p, and if x is an element of order a 
prime, not p, in G/H(p), then the centralizer(**) of x in G/H(p) contains an 
element different from 1 in (H(p)H(p))/H(p). 


REMARK. The-condition (p) of Theorem IV.4.1 implies that elements of 
order p in Gc,/H(p) are in (H(p)H(p))/H(p), since Gc S$ H(p)H(p); and this 
condition (p) is a consequence of the condition (p*) above, since H(p) is part 
of the cross-cut of all the Ge, for g#p, and since Ge is exactly the cross-cut 
of all the Ge,. ; 

Proof. We precede our arguments by the proofs of some lemmas. 

(1) If the element x in G/H(p) is of order a prime, not p, then the centralizer 
of x in G/H(p) is a cyclic group T such that G/H(p) =T(Gc,/H(p)) and such 
that the cross-cut of T and Gc,/H(p) is 1. 

We note first that G/H(p) is Cp-complete. Hence Gc,/H(p) is, by Theo- 
rem III.1.2, a p-group which certainly does not contain the element x. It is 
a consequence of II.1.A that the automorphism of E(p) induced by x has no 
fixed elements except 1; and hence it follows from Theorem III.1.2 and the 
Cp-completeness of G/H(p) that x does not commute with any element in 
Gc,/H(p) except 1; that is, the cross-cut of T and Gc,/H(p) is 1. 

Since G/Ge» is cyclic, there exists an element g in G/H(p), generating 
G/H(p) modulo Ge,/H(p). The order of G/Ge, is divisible by the order r 
of x so that the order of the automorphism g of E(p) is divisible by the prime 
rp; and it follows from II.1.A that the integer g—1 is prime to p. A certain 
power g” of g induces in E(p) the same automorphism as x; and we may as- 
sume without loss of generality that g has been chosen in such a fashion that 
is a divisor of the order of the automorphism g of E(p), that is, that mr is the 
order of the automorphism g of E(p). Then x’=xg-* is an element in 
Gc,/H(p). Since g'-*, g—1 and g*—1=x—1 modulo p™” are all integers 
prime to ~, it follows that 1+g-'+ - - - --- 
=g'-"(g"—1)/(g—1) modulo p” is an integer prime to ~; and hence there 
exists an element s in Gc,/H(p) such that s!+¢"'+---+e"" =x’, But then we 
deduce from Theorem III.1.2 and the Cp-completeness of G/H(p) that 


(*) This hypothesis is automatically satisfied if p is odd. The case excluded by this hy- 


pothesis is treated in Theorem IV.4.4 below. 
() The centralizer of the element x in the group K consists of all the elements z in K such 


that sx =<xz. 
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(sg)* = sito = = xg-"g" = x. 


Clearly sg=¢ belongs to 7; and from g=¢ modulo Gce,/H(p) we infer that 
G/H(p) = T(Ge,/H(p)). The cyclicity of T is a consequence of the facts that 
the cross-cut of T and Gc,/H(p) is 1, and that G/Ge, is cyclic. This completes 
the proof of (1). ; 

(2) If x is an element of order p in G/H(p), but not in Gc,/H(p), then 
every Co-character of Gcp/H(p) is induced by some Cp-character of G/H(p) 
which maps x upon 1. 

The order of G/Gc, is of the form pn; and because of the cyclicity of 
G/Ge, there exists an element g in G/H(p) such that g* and x induce the same 
automorphism in E(p) and such that g generates G/H(p) modulo Gc,/H(p). 
Clearly x’ =xg-* is an element in Gc,/H(p). Then we infer from the Cp-com- 
pleteness of G/H(p) and Corollary 1.3.2 that 


or gt=g"?=x'—(ite"+---+0""™) We note that g* generates the cross-cut 
{ghcp of Gep/H(p) and of the cyclic group {g}, generated by g. Since 
G/H(p) is Cp-complete, so is its subgroup {g}; and it follows from Theorem 
1.3.1 that every Co-character of { g } cp is induced by some Cp-character of { g } : 
Furthermore we have 


= g” modulo pr), 


since g”?=1 modulo p™), 

Since the C-character group of a cyclic group is by Lemma III.1.1 itself 
a cyclic group, and since {g} is Cp-complete, there exists a Cp-character d 
of {g} with the property d(g*) =1 implies g‘=1. Since x’ is in Gcp/H(p), and 
since Cp-characters are Co-characters in Gc,/H(p), we find 


Remembering that mp is the order of the automorphism g of E(p) one readily 

deduces from the last identity that the order of x’ is a factor of the order 

of d(g"). 

. If now f is any Co-character of Gc,/H(p) in E(p), then there exists an 
integer ¢ such that d(g*)'=f(x’—®), since the order of f(x’-”) is a divisor of 

the order of x’; and we verify that 


d(g*)' d(g")! = d(g”) 


= f(g*). 
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Hence we infer from Lemma I.3.4 the existence of a Cp-character f; of G/H(p) 
which induces d‘ in {g} and f in Ge,/H(p); and this character f; satisfies 
=falxe’g”) =f(x’) f(x’) =1, as was to be 
shown. 

Suppose now that every complete group of C-characters of G is p-com- 
plete. Then we denote by B, the group of C-characters which is generated 
by all the Cq-characters for g¥ p. 

Assume first that the element x in G/H(p) is of order p. If x were in 
Gc,/H(p), then we inferred from Theorem IV.4.1(p) that x is in GcH(p)/H(p) 
<(H(p)H(p))/H(p). Thus we consider the case where x is not in Gc»/H(p). 
Then we denote by B, the group of all the Cp-characters of G (or G/H(p)) 
which map x upon 1; and we put B=B, B,. This group is certainly not p-com- 
plete, as B, maps upon 1 all the elements in the subgroup {x} of order p of 
G/H(p). If y is an element in G/H(p), but not in the subgroup generated by x 
and Gc,/H(p), then there exists(**) a Cp-character in the principal genus of 
G/H(p) which maps x, but not y, upon 1; and if y belongs to the subgroup 
generated by x and Gc,/H(p), though y does not belong to {x}, then we infer 
from (2) the existence of a Cp-character in B, which does not map y upon 1. 
This shows that {x} is the subgroup of those elements in G/H(p) which are 
mapped upon 1 by every Cp-character in B,. It is a consequence of the defini- 
tion of B; and H(p) that H(p) is the set of all the elements in G mapped 
upon 1 by all the C-characters in By ; and hence it follows that the set of 
elements mapped upon 1 by all the C-characters in B is just the cross-cut X* 
of H(p) and the subgroup X of G which contains H(p) and satisfies 
{x} =X/H(p). Since B is not p-complete, it follows from our present hy- 
pothesis that X* 1. Since G is C-complete, it follows that the cross-cut of 
H(p) and H(p) is 1; and since X/H(p) is of order p, we deduce that 
X <H(p)H(p), showing the necessity of condition (p*). 

To show the necessity of condition (p**) we consider an element x in 
G/H(p) whose order is a prime r#p. Then we infer from (1) the existence 
of an element g in G/H(p) such that g generates G/H(p) modulo Gc,/H(p) 
and such that g*=x where mr is the order of the automorphism g of E(p). 
We consider the group B, of all the Cp-characters of G or G/H(p) which 
map g upon 1. It is a consequence of (1) and of Lemma I.3.4 that every 
Co-character of Gc,/H(p) in E(p) is induced by one and only one Cp-char- 
acter in B,; and this shows that the group { g} generated by g is the subgroup 
of all the elements in G/H(p) which are mapped upon 1 by every Cp-character 
in B,. Denote by B the product(*) of B/ and B,. Then the group of elements 
mapped upon 1 by all the C-characters in B is the cross-cut X* of H(p) and 


(*®) Take an element v of order p in E(p). Then v'~* =1, though v'-" #1, since the order 
of y modulo Gey is neither 1 nor p, as G/Gcz is cyclic. 

(*) Where BJ denotes, as before, the group generated by all the Cq-characters for g any 
prime not p, that is BJ consists of all the C-characters of order prime to p. 
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of the subgroup X of G which contains H(p) and satisfies X/H(p) = {g}. 
Since B is not p-complete, it is not complete, showing that X* 1; and not- 
ing that { g} is exactly the centralizer of x in G/H(p) (cp. (1)!), we deduce the 
necessity of conditon (p**). 

Assume now that conditions (p*) and (p**) are both satisfied by G, C, 
and p, and that B=B, B, is a complete group of C-characters of G where B, 
consists of Cp-characters of G only, whereas B, is generated by Cq-characters 
of G with gp. This implies in particular that B, (H(p)) =1; and we deduce 
from the completeness of B that there exists to every element, not 1, in H(p) 
a Cp-character in B, which does not map it upon 1. 

Consider an element x in G whose order modulo H(p) is p. Then we infer 
from condition (p*) that x is in H(p)H(p); and one deduces from the remark 
made at the end of the last paragraph the existence of a Cp-character in B, 
which does not map x upon 1. 

Next we consider(*) an element x whose order modulo H(p) is a prime 
rp. Then we infer from (1) that the centralizer T of H(p)x in G/H(p) is a 
cyclic group, generated by an element g, such that the cross-cut of T and 
Ge,/H(p) is 1, and such that G/H(p) = T(Gc,/H(p)). If p is a divisor of the 
order of G/Gc», then p is a divisor of the order of T; and it follows from (p*) 
that the cross-cut of T and (H(p)H(p))/H(p) is different from 1. If p is not 
a divisor of the order of. G/Ge», then we infer from condition (p**) that the 
cross-cut of T and (H(p)H(p))/H() is different from 1. Denote by 7* the 
cross-cut of T and (H(p)H(p))/H(p). Then we have seen that 7* is a cyclic 
group different from 1; and there exists an element ¢ in H(p) such that H(p)é 
generates 7*. We note that 1. Since ¢ is in H(p), it follows that B, (#) =1; 
and we infer from the completeness of B the existence of a Cp-character 5 
in B, such that b(#) #1. It is a consequence of Theorem II.2.1 that every 
Cp-character of T belongs to the principal genus; and hence there exists an 
element v in E(p) such that b(y)=v'-* for y in T. From 13b(t)=v'-' we 
infer v¥*1; and since x —1 is by II.1.A prime to 9, it follows that d(x) =v'-* #1. 
Thus we have shown that no element of order a prime in G/H(p) is mapped 
upon 1 by every Cp-character in B (or B,); and from this fact one deduces 
the p-completeness of B. 

In Theorem IV.4.3 we discussed only those primes p for which G/Ge, is 
cyclic; and this is always the case if » is odd. Thus we have to discuss the 
situation in case G/Ge: is not cyclic. Then there exists a coset Zy of G/Ge: 
all of whose elements induce in E(2) the inversion. Apart from Zp» there are 
two further cosets of order 2 in G/Ge2; and it follows from I1.1.B that the 
integers corresponding under C2 to these two cosets are of the form 
+1+2"-! modulo 2. We denote by Ko the uniquely determined coset 
of order 2 of G/Gcz to whose elements there corresponds under C2 the in- 


(®) Since G/H(2) is a 2-group, the existence of such an element x implies that p is odd. 


| 
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teger 1+2"-! modulo 2”; and we denote by K = K(G, C) the subgroup 
of G whose elements are either in Gc: or in Ko. In case the order of G/Ge: is 
divisible by 8 it is possible to characterize K as the subgroup between Geo 
and G*G_ with the property that K/Ge: is of order 2. 


THEOREM IV.4.4. If G is C-complete, and if G/Ge2 is not cyclic, then the 
following conditions are necessary and sufficient for 2-completeness of every com- 
plete group of C-characters of G: 

(i) H(2)H(2) contains every element in the subgroup K whose order 
modulo H(2) is 2(*). 

(ii) If G/Ges is of order 4, then H(2)H(2) contains the following elements: 

(ii’) Every element x in G whose order modulo H(2) is 2 and which does not 
induce the inversion in E(2), and 

(ii’’) Every element z in G which induces the inversion in E(2) and which 
satisfies (zy)*=y?""" modulo H(2) for every y in K, y not in Ges. 


Proof. We start by proving some lemmas. 

(1) If S is a subgroup of K/H(2), then every C2-character of S is induced 
by some C2-character of G. 

It is obvious that the group of all the C2-characters of G induces in S 
a complete group of C2-characters. But the only complete group of C2-char- 
acters of SS K/H(2) is the group of all the C2-characters of S, as follows from 
Theorem IV.2.2, if S is not part of Ge2/H(2), and from Theorem I.1.4, if 
SSGe:/H(2). 

(2) If S is a subgroup of K/H(2), and if P(S) is the group of C2-characters 
of G, mapping S upon 1, then there exists to every element in K/H(2), not in S, 
a C2-character in P(S) which does not map it upon 1. 

If P2 is the group of all the C2-characters of G, then we infer from (1) 
that P:/P(S) is essentially the same as the group of all the C2-characters of S. 
It follows from Theorem II.5.2 that the orders of S and of its group P;/P(S) 
of C2-characters are equal. If S’’ is the group of elements in K/H(2) mapped 
upon 1 by all the C2-characters in P(S), then SSS’’ and P(S) = P(S’’) show- 
ing that S and S’’ are of equal order; and this implies the desired equality 
of S and S”’. 

Assume now that every complete group of C-characters of G is 2-complete. 
Consider an element x of order 2 in K/H(2); and denote by B, the group 
of all the C2-characters of G which map x upon 1, by Q the group of all the 
C-characters of odd order of G, and by B the product of Q and B,. The group 
of elements in G, mapped upon 1 by Q, is H(2); and the group of elements in G 
mapped upon 1 by Bz is, by (2), the group X, generated by the elements in 
the coset x of K/H(2). Since B is obviously not 2-complete, it follows that B 


(*) Thus H(2) H(2) contains every element in Gc2 whose order modulo H(2) is 2; and it 
may happen that these are the only elements in K whose orders modulo H(2) are 2. 
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is not complete; and this implies that the cross-cut of X and H(2) is different 
from 1. Since X/H(2) is of order 2, and since the cross-cut of H(2) and (2) 
is 1 as a consequence of the C-completeness of G, it follows now that 
X SH(2)H(2), proving the necessity of condition (i). 

Suppose now that the order of G/Gez is 4. If x is an element of order 2 in 
G/H(2) which does not induce the inversion in E(2), then it is a consequence 
of condition (i), already proven, that x is in (H(2)H(2))/H(2), provided x 
is in K/H(2). If x is not in K/H(2), then the integer corresponding to x under 
C2 is of the form x= —1+2™-! modulo 2”. We denote by B, the group 
of all the C2-characters of G which map x upon 1; and by W the group, gen- 
erated by adjoining x to Ge2/H(2). From x?=1 and Lemma I.3.4 we infer 
that every Co-character of Ge2/H(2) is induced by a C2-character of W which 
maps x upon 1; and it follows from Corollary III.1.3 and the fact that W 
is of index 2 in G/H(2) that every C2-character of W is induced by a C2-char- 
acter of G/H(2). Since B, contains the C2-character which maps W upon 1 
and the elements in G/H(2) which are not in W upon the element of order 
2 in E(2), it follows that only elements in W are mapped upon 1 by By. 
Since every Co-character of Ge2/H(2) is induced by a C2-character in Bz, 
and since G/H(2) is C2-complete, it follows that elements not equal to 1 in 
Ge2/H(2) are not mapped upon 1 by every C2-character in B;. Thus it follows 
that x and 1 are the only elements mapped upon 1 by every C2-character in 
B;. Denote now by Q the group of all the C-characters of odd order of G, and 
by B the product of Q and By. If we denote by X the subgroup consisting of 
the elements in H(2) and in the coset x, then the cross-cut of X and H(2) is the 
set of elements mapped upon 1 by B. Since B is not 2-complete, it follows 
that B is not complete and that therefore the cross-cut of X and H(2) is 
not 1. Since X/H(2) is of order 2, this implies X S$ H(2)H(2), that is, we have 
verified the necessity of condition (ii’). 

Suppose again that G/Gc2 is of order 4. Then G may be generated by ad- 
joining to K the elements in G which induce the inversion in E(2). Assume 
furthermore that the element z in G/H(2) induces the inversion in E(2) and 
satisfies (zy)?=y?"°" for every y in K/H(2) which is not in Gg,/H(2). Then 
it follows from Lemma IV.3.1(c) that every C2-character f of G/H(2) satisfies 


f(z)» = f([z, y]})*f(y)? = = f(y??? -*)"f(y)? 
= f(y? *-D fy)? f(y) f(y)? 
= f(y f(y)? = = 1; 


and it follows from Corollary III.1.3 that every C2-character of K/H(2) is 
induced by a C2-character of G/H(2) which maps z upon 1. If we denote 
by B, the group of all the C2-characters of G which map the coset z of G/H(2) 
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upon 1, then it is readily seen that the subgroup X, generated by the elements 
in the coset z and containing H(2), consists exactly of the elements in G 
mapped upon 1 by the C2-characters in B:, since K/H(2) is C2-complete. De- 
note by Q again the group of C-characters of odd order of G. Then H(2) is 
the set of elements mapped upon 1 by Q. The group B=QB, is not 2-com- 
plete; and thus it follows that B is not complete. This implies that the cross- 
cut of H(2) and X is different from 1; and we deduce X < H(2)H(2) from the 
fact that X/H(2) is of order 2, proving the necessity of condition (ii’’). 

We assume now conversely that the conditions (i) and (ii) are satisfied 
by G, C, and that B is a complete group of C-characters of G. This group B 
is the direct product of the groups B, and P where P consists of the C-char- 
acters of odd order in B whereas B, is the subgroup of all the C2-characters 
in B. We denote by R a group between K and G whose elements do not induce 
the inversion in E(2) and whose index in G is 2. If x is an element of order 2 
in R/H(2), then x is in K/H(2), since R/H(2) is cyclic. Consequently we may 
deduce from condition (i) that x is in (H(2)H(2))/H(2). There exists there- 
fore an element w#1 in H(2) such that x =H(2)w. From the completeness of 
B we infer the existence of a C2-character } in B, which does not map w 
upon 1. Hence b(x) =b(w) #1, showing that B, induces a complete group of 
C2-characters in R/H(2). Thus we may deduce from Theorem IV.2.2 that 
every C2-character of R/H(2) is induced by some C2-character in Bz. 

If s is an element of order 2 in G/H(2), and if s is in R/H(2), then we have 


already shown that B,(s) #1. If s is not in R/H(2), then we distinguish two 


cases. 
Case 1. s does not induce an inversion in E(2). Then there exists an element 


g generating R/H(2) modulo Ge:/H(2) and an inversion z in G/H(2) such 
that s=zg” for some i. Since s is not an inversion, g? is not in Ge2/H(2). 
Since s is of order 2, we find that 


1 = s? = (zg)? = * = [z, gt ’ 
= 2], 


since inversions are induced in E(2) by elements of order 2 in G/H(2), as 
follows from Theorem III.1.2. Thus it follows that the automorphism g” of 
E(2) is of order 2, that gt is therefore in K/H(2), and that the integer cor- 
responding under C2 to the element g* in K/H(2) is g*?=1+2™-1 
modulo 2”), 

Every C2-character of R/H(2) is induced by some C2-character in Bz. 
Thus there exists a C2-character f in B, such that f(#) =e!‘ for ¢ in R/H(2) 
where ¢ is an element generating E(2). If 0<i, then the order of g***’ does not 
exceed 2”(2—*! which is a divisor of 2"); and thus it follows from Corollary 
III.1.3 that f(z) is of the form e*+/"™, generates E(2)?. Consequently we 
find that f(g") = f(g") = f(z)!+2" £1, since the second factor 
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is of order 2, the first is of order 2", that is B,(s) #1. If, however, 1=0, 
then g? is in Ge2/H(2) and the order of G/Gez is 4. In this case we infer from 
condition (ii’) that s is in (H(2)H(2))/H(2); and there exists an element s’’ 
in H(2) such that s=H(2)s’’. Since B is complete, and since P(H(2)) =1, 
there exists a C2-character d in B, such that d(s) =d(s’’) #1; and we have 
shown again that B.(s) 1. 

Case *, An inversion is induced by s in E(2). Then we consider again an 
element g in R/H(2) which generates R modulo Ge2. There exists a C2-char- 
acter f in Bz such that f(¢) =e! for tin R/H(2) where the element e generates 
E(2). It follows from Lemma IV.3.1(c) that f(s)!~¢ =e*(*-); and this implies 
f(s) #1, provided the automorphism g of E(2) is not of order 2; B2(s) #1 pro- 
vided G/Ge: is not of order 4. 

If G/Ge: is of order 4, then it may happen that there exists a C2-char- 
acter f of R/H(2) such that f([s, g])*f(g)?#1. This C2-character is induced 
by some C2-character f in B,; and it follows from Lemma IV.3.1(c) that 
f(s)'-"¥1 so that again B,(s) #1. 

Suppose finally that 1=f([s, g])*f(g)? for every C2-character f of R/H(2). 
Then we note that the integer corresponding to g under C2 is g=1+2"@)-! 
modulo 2”), since g induces an automorphism of order 2 in E(2), and since g 
is in R= K. Consequently we find: 


S((sg)?g- = f(sgsg-tg? = f([s, g)) 
= g]) (since g? is in Ge2/H(2)) 
= f([s, g)) f(g) 
= f([s, f(g)? = g)*f(g)? = 1, 


since f(|s, g]) is of an order dividing 2”)-', and we infer from the C2-com- 
pleteness of G/H(2) that R/H(2) is C2-complete, and that therefore 
(sg)?=g?". Since g may be any element in R/H(2)=K/H(2) which is 
not in Ge2/H(2), we may apply condition (ii); and it follows that s is in 
(7(2)H(2))/H(2); and there exists an element so in H(2) such that s = H(2)so. 
Since B is complete, we have B(so) #1. Since so is in H(2), we have Q(so) = 1. 
Hence there exists a C2-character } in Bz, such that b(s) =b(so) #1, proving 
again that B,(s) #1. 

Thus we have finally shown that B,(s) 1 for every s in G whose order 
modulo H(2) is exactly 2; and that proves the 2-completeness of B, as was to 
be shown. 

For the enunciation of the next theorem we need a restatement of a con- 
cept introduced before (cp. Lemma IV.3.4!): If G/Ge2 is not cyclic, then there 
exists one and only one subgroup A = A(G, C) which contains Gz, is of index 2 
in G, does not contain any element inducing the inversion in E(2), and con- 
tains an element such that the integer corresponding to it under C, is congru- 
ent to —1 modulo 4. 
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THEOREM IV.4.5. If G is C-complete, then the following conditions are neces- 
sary and sufficient for the full C-character group of G to be the only complete 
group of C-characters of G: 

(a) If G/Gep ts. cyclic, then H(p)H(p) contains every element in G whose 
order modulo H(p) ts p. 

(b) If G/Gep is of order prime to p, and if the element x in G is modulo H(p) 
of order a prime number different from p, then H(p)H(p) contains an element y 
which is not in H(p), but satisfies xy=yx modulo H(p). 

(c) If G/Ges is not cyclic, then H(2)H(2) contains every eiement in the sub- 
group K (introduced in Theorem 1V.4.4) whose order modulo H(2) is 2. 

(d) If p is odd and G#Gcp, then either m(p) =1 or the order of G/Gc, is 
divisible by p. 

(e) If G/Ge is of order 2, and if there exists an element in G to which there 
corresponds under C2 an integer congruent to —1 modulo.4, then H(2)Gto 
<H(2)G’. 

(f) If G/Gee is not cyclic, and if the order of G/Gcez is divisible by 8, then 
H(2) G’ is not part of A*H(2). 

(g) If G/Gez is not cyclic, but of order 4, then neither H(2)G' nor H(2)A? 
is equal to H(2)G?. 


Proof. It is readily seen that the following two conditions are necessary 
and sufficient for the full C-character group of G to be the only complete 
group of C-characters of G: } 

(A) Every complete group of C-characters of G is p-complete, for every 
prime p. 

(B) The full Cp-character group of G (or of G/H(p)) is the only complete 
group of Cp-characters of G/H(p), for every prime p. 

It is an immediate consequence of Theorems IV.2.2 and IV.3.7 that the 
conditions (d) to (g) of the theorem are equivalent to the property (B); 
and it is a consequence of Theorems IV.4.3 and IV.4.4 that (A) implies con- 
ditions (a) to (c). If conversely all the conditions (a) to (g) are satisfied by 
G and C, then the conditions (p*) and (p**) of Theorem IV.4.3 are conse- 
quences of (a) and (b) respectively; and condition (i) of Theorem IV.4.4 is 
an immediate consequence of the present condition (c). Condition (ii’’) of 
Theorem IV.4.4 is satisfied, since we may infer from (g) the impossibility of 
the existence of an element z meeting the requirements of condition (ii’’) 
of Theorem IV.4.4 considering that 2 <m(2), if G/Gc2 is not cyclic. Condition 
(ii’) of Theorem IV.4.4 is a consequence of condition (c) of the present theo- 
rem, since elements of order 2 in G/H(2) which do not induce the inversion 
in E(2) belong by Theorem IV.4.5(i) and Lemma IV.3.4 to K/H(2), if the 
order of G/Ge:2 is 4 and G/Ge: is not cyclic. Thus it follows from Theorems 
IV.4.3 and IV.4.4 that condition (A) is a consequence of the conditions (a) 
to (g); and this completes the proof. 
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CHAPTER V. DUALITY 


A projectivity of the group S upon the group T is a biunivoque corre- 
spondence which maps the set of all the subgroups of S upon the set of all the 
subgroups of T and which has the property of preserving the relation “3.” 
A duality of the group S upon the group T is a biunivoque correspondence 
which maps the set of all the subgroups of S upon the set of all the subgroups 
of T and which has the property of inverting the relation “ $.” If the product 
of two dualities exists, then it is a projectivity. 

If G is a group and C a homomorphism of G into the group of automor- 
phisms of the cyclic group E of order m, then we denote the C-character group 
of G by L=L(G, C). This group L is an abelian group the orders of whose 
elements are divisors of m; and thus it follows from Theorems I.1.3 and 1.1.4 
that ZL and the group Lp» of its Co-characters in E are isomorphic groups; 
and that there exists a duality between Z and Lo. Thus the problems of con- 
structing a duality between G and L and that of constructing a projectivity 
between G and Lo are equivalent problems. 

V.1. The natural projectivity between G and Ly. Throughout this section 
we make use of the notations (II.3.*). In the proof of Theorem IV.1.2 we 
have introduced the operator F,, defined by the equation F,(f) =f(g) for g an 
element in G and f a C-character of G. 


LemMA V.1.1. Suppose that G is C-complete. 


(a) F, is a Co-character of the C-character group of G, that is F, is, for every g 
in G, an element in Lo. 

(b) If s and t are elements in G such that F,= F;,, then s =t. 

(c) If Sis a subset of G such that the set Fs is a subgroup of Lo, then S is a 
subgroup of G. 


REMARK. Note that Fg need not be a subgroup of Lo, though Fg is always 
a subset of Lo. In this respect see Theorem V.1.2. 

Proof. Statements (a) and (b) are easily verified; see for example (1.3.5.1) 
and (1.3.5.2). Suppose now that S is a subset of G such that Fs is a subgroup 
of Lo, and that s, ¢ are elements in S. Then the characters F,, F,, Fi and FF, 
belong to Fs. If f is any C-character of G, then FiF,(f) = (F.(f))'F.(f) =f(s) f( 
=f(st) = F,,(f); and it follows from (b) that s# is in S, proving (c). 


THEOREM V.1.2. If G is C-complete, then each of the following properties 
implies the others: 

(1) The set Fg is a group of Co-characters of L(G, C). 

(2) C is regular. 

(3) Fs is a group of Co-characters of L(G, C) tf, and only if, S is a sub- 
group of G. 
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REMARK. It is readily deduced from Lemma V.1.1 that Fs is a projectivity 
of G upon Lp if, and only if, (3) holds true. 

Proof. It is obvious that (1) is a consequence of (3). If (1) is satisfied by G, 
then Fg is a complete group of Co-characters of L(G, C), since F,(f) =1 for 
every g implies f(g) =1 for every g, that is, implies f= 1. Hence it follows from 
Theorem I.1.4 that Fg is the group Lp of all the Co-characters of L(G, C) 
in E. It follows from Theorem I.1.3 and Lemma V.1.1(b) that G, Fg=Lo, 
and L are of equal order, and it follows from Corollary I1.5.3(b) that (2) is a 
consequence of (1). Suppose finally that (2) is satisfied by G and C. Then we 
deduce’ from Lemma-V.1.1(c) that we need prove only the following fact: 
Fs is a group, whenever S is a group. If S is a subgroup of G, then S is C-com- 
plete and satisfies the same condition (2) as G. Hence it follows from Corol- 
lary II.5.3(b) that S and its C-character group L(S, C) are of equal order. 
Since L(G, C) induces in S a complete group of C-characters, and since it 
follows from Theorem IV.1.2 that the C-character group of S is the only 
complete group of C-characters of S, we find that every C-character of S is 
induced by some C-character of G. If we denote by Ls the group of all the 
C-characters of G which map S upon 1, then it follows that L(S, C) is essen- 
tially the same as L(G, C)/Ls; and since L(S, C) and S, L(G, C) and G are 
of equal order, we deduce that the order of Ls is the index [G:S] of S in G. 
Denote by L§, the group of all the Cy-characters of L(G, C) in E which map 
Ls upon 1. It is obvious that Fs SL§, since f(S) =1 and Fs(f) =1 are equiva- 
lent statements. It is a consequence of Theorems I.1.3 and I.1.4 that the 


order of Lj is the index of the subgroup Ls of L(G, C) in L(G, C). Since G 
and L(G, C) are of equal order, and since the order of Ls is [G:S], it follows 
that S and Ls are of equal order, proving that Fs= Lj, since Fs contains by 
Lemma V.1.1(b) as many elements as S. Since Lj is a group of Co-characters 
of L(G, C), it follows now that Fs is a group of Co-characters of L(G, C), as 
was to be shown. 


THEOREM V. 1. 3. The C-complete group G and its C-character group L are 
isomorphic if, and only if, G is abelian and C is regular. 


Proof. If G and L are isomorphic, then G is commutative, since L is com- 
mutative. Both groups furthermore have equal order and the regularity of C 
is a consequence of Corollary II.5.3. 

Suppose conversely that G is commutative and that C is regular. Then it 
follows from Theorem V.1.2 that a projectivity of G upon Lp is effected by 
mapping the subgroup S of G upon the subset Fs of Lo. Since S and Fs are 
groups of equal order, it may be deduced from a well known theorem(*’) that 

(?") See Rottlinder, Math. Zeit. vol. 28 (1928) pp. 641-653 or R. Baer, Amer. J. Math. 


vol. 61 (1938) p. 30; one may verify this fact readily by showing the equality of the invariants 
of G and Lo. 
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G and Ly are isomorphic. We infer from Theorem I.1.3 that Z and Lp are iso- 
morphic, proving that G and L are isomorphic too. 

V.2. The natural duality between G and L. The following procedure is 
customary for setting up a duality between groups and their character 
groups: using the notations (II.3.*) and assuming that G be C-complete, 
we denote by (L, S), for S a subgroup of G, the set of all the C-characters 
of G which map S upon 1; and we denote by (G, T), for T a subgroup of the 
C-character group L of G, the set of all the elements x in G, satisfying 
T(x) =1. Clearly (LZ, S) is a subgroup of Z and (G, T) a subgroup of G; and 
it is readily verified that these correspondences effect dualities if, and’ only if, 


(V.2.1) (LZ, (G, T)) =T and (G, (L, S)) =S for every subgroup S and T of G 
and L respectively. 


It is our object to determine those G, C which meet the requirement 
(V.2.1). It is easy to see for example that the groups G, satisfying condition 
(3) of Theorem V.1.2, have the property (V.2.1). 


THEOREM V.2.2. If G is C-complete, then the following conditions are neces- 
sary and sufficient for G, C to satisfy (V.2.1): 

(i) The orders of H(p) and of G/H(p) are relatively prime. 

(ii) If the order of G/Gc, is not a power of p, then m(p) =1 and the order of 
G/Ge, is a prime (dividing p—1). 

(iii) C2 ts regular. 


Proof. Suppose first that (V.2.1) is satisfied by G and C. Denote by L, the 
group of Cp-characters of G. Then H(p) =(G, L,) is nothing but a restatement 
of the definition of H(p). We deduce from (V.2.1) that L,=(L, H(p)) and 
that therefore the full Cp-character group L, of G/H(p) is the only complete 
group of Cp-characters of G/H(p). Hence we infer from Theorem IV.2.2(1) 
that either the order of G/Gc, is divisible by p or G=Gc, or m(p) =1. Denote 
by P, the Cp-principal genus of G (or G/H(p)). Then Gc,=(G, P,) and we 
deduce from (V.2.1) that P,=(L, Gey). Applying this last result on p=2, 
we infer from Theorem II.3.2 that C2 cannot be singular. Every G/Gc, is 
therefore cyclic. Since P, is by Theorem II.2.1 a cyclic group of order a power 
of p, we may infer from (V.2.1) that G/Gey» is a cyclic group of prime power 
order, containing as many subgroups as does P,; and the necessity of the con- 
ditions (ii) and (iii) is now an immediate consequence of Theorem II.2.1. 

Since L is the direct product of all the L,, we find that H(p) =(G, [] p«eZ); 
since the cross-cut of L, and [| pe, is 1, then G is the product of H(p) 
and H(p) (for every p); and this proves that G is the direct product of 
all the H(p). Since the orders of L, and] p«¢Z, are relatively prime, it follows 
that every subgroup T of L is the cross-cut of TL, and of T] | pxel,; and hence 
it follows from (V.2.1) that every subgroup S of G is the product of its 
cross-cut with H(p) and of its cross-cut with H(p), proving that the orders of 
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H(p) and H(p) are relatively prime; and thus we have verified the necessity 
of (i). 

Suppose conversely that the conditions (i) to (iii) are satisfied by G and C. 
Then we infer from (i) and Corollary I1.4.4 that G is the direct product of its 
subgroups H(p); and that G is, for every prime p, the direct product of H(p) 
and H(p). 

Consider a subgroup S of H(p). Since every C-character of G maps H(p) 
into E(p), it follows that H(p) and S are mapped upon 1 by every Cg-char- 
acter for g¥; and since G is the direct product of H(p) and H(p), it is read- 
ily seen that the C-characters of G induce in H(p) a group of Cp-characters 
which is essentially the same as the group L, of the Cp-characters of G. 
Thus (L, S)=(L, where Lj and where (L, S), is the group 
of Cp-characters of G which map S upon 1; and this shows that (G, (Z, S)) 
is exactly the cross-cut of (G, (L, S),) and H(p), that is (G, (LZ, S)) consists 
of those elements in H(p) which are mapped upon 1_by the Cp-characters 
in (ZL, S). If the order of G/Gc, is a power of », then we infer from (iii) the 
regularity of Cp and it follows from Theorem V.1.2 that S=(G, (ZL, S)). 
If the order of G/Gce, is not a power of p, then it follows from (ii) that 
m(p)=1 and that G/Gc, is of order a prime p*#p. Consequently either 
SSH(p)cp or H(p)=SH(p)cp; in the first of these cases our contention 
S=(G, (LZ, S)) is an immediate consequence of Theorems I.1.4 and I.3.1; 
in the second of these cases we infer from Lemma I.3.4 that every Co-char- 
acter of H(p)c, in E(p) which maps S¢, upon 1 is induced by a Cp-character 
in (L, S),; and S=(G, (L, S)) is a consequence of Theorem I.1.4 

If S is a subgroup of G, and if S, is the cross-cut of S and H(p) then S 
is the direct product of the S,, since the orders of H(p) and H(p) are rela- 
tively prime by (i). If f is a C-character of G, then f is the product of uniquely 
determined Cp-characters f,. Since f,(H(p)) =1, it follows that f(S)=1 if, 
and only if, f,(S,) =1 for every p. This shows that (L, S) is the direct product 
of the groups (L, S,), consisting of all the Cp-characters of G which map S, 
upon 1. But we have shown in the preceding paragraph of the proof that 
S,=(G, LJ (L, and this makes the desired identity, S=(G, (L, S)) 
evident. 

If T is a subgroup of the group L, of the Cp-characters of G, then (G, T) 
contains H(p), since L,(H(p)) =1. If f is a Cg-character of G for g#p which 
maps H(p) upon 1, then f maps H(p) upon 1 too; and this implies f=1, 
since G is the direct product of H(p) and H(p). Thus it follows that (L, (G, T)) 
is a group of Cp-characters of G. If the order of G/Gc, is a power of p, then 
it follows from (iii) and Theorem V.1.2 that T=(L, (G, T)). If the order of 
G/Gc, is not a power of p, then it follows from (ii) that m(p) =1 and that 
G/Gez is of order a prime p* ¥p. If the principal genus P, of G/H(p) in E(p) 
is part of T, then (G, T) is between H(p) and Ge,; and it is readily verified 
that T=(L, (G, T)) by Theorems I.1.4 and I.3.1 and the fact that (Z, (G, T)) 
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is a group of Cp-characters. Thus we assume finally that P, is not part of T. 
We note that P, is of order p, and that every Cp-character of G different from 
1 is of order p, since E(p) is of order p. We wish to show that (G, T) containsan 
element not in Gey. Denote by g an element in G/H(p), not in-Gc,/H(p). Then 
g generates G modulo Ge», since G/Gc, is of order a prime p*; and it is a con- 
sequence of the Cp-completeness of G/H(p) and of I1.1.A and Theorem I.3.1 
that p* is the order of g. Our object is attained if g is in (G, T); and thus we as- 
sume that g is not in (G, T). We denote by 7* the subgroup of those Cp-charac- 
ters in T which map g upon 1; since E() is of order p, and since the Cp-charac- 
ters of G/H(p) therefore map g upon different values, it follows that T/T* 
is of order p. Since P, is of order p, and since P, is not part of 7, it follows 
that the cross-cut of T and P, is 1; and hence it follows from Theorem 
II.2.1(a) that different Cp-characters in T induce different Co-characters of 
Gce,/H(p) in E(p). Hence there exists an element in Gce,/H(p) which is 
mapped upon 1 by 7*, but not by T. Since the elements in Gc,/H(?) are 
of order p, this implies the existence of an element y in Ge,/H(p) such that 
the Cp-characters in T have the same values on g and on y. Consequently 
gy~! is mapped upon 1 by the Cp-characters in T so that the elements in the 
coset gy~! are in (G, T), though not in Gey. Denote now by w any element in 
(G, T), not in Ge». If f is a C-character in (L, (G, T)), then f is a Cp-character 
of G; and there follows from Theorem I.1.4 the existence of a Cp-character f* 
in T which coincides with f on Gc». But both f and f* map w upon 1; and G is 
generated by adjoining w to Ge». Hence f=f* is in T, proving the desired 
equation T =(L, (G, T)). 

Consider now any subgroup T of L; and denote by T, the group of the 
Cp-characters in T. Then T is the direct product of the subgroups T,; and 
(G, T) is the cross-cut of the groups (G, T,). Since (G, T,) contains, as has 
been remarked before, the subgroup H(p), it follows that (G, T,) is the direct 
product of H(p) and of a subgroup U, of H(p), as G is the direct product of 
H(p) and H(p). Since U, is part of every H(q) for q¥>, it follows that U, 
is part of (G, T). If f is a Cp-character in (L, (G, T)), than f maps both H(p) 
and U, upon 1 so that f belongs to (L, (G, T,)). But we have shown in the 
preceding paragraph of the proof that T,=(L, (G, T,)) showing that f is 
in 7; and now it is obvious that T=(L, (G, T)), as was to be shown. 
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The present note deals with spaces in which geodesics exist and are 
uniquely determined by any two of their points. 

The existence of geodesics without additional topological or differentiabil- 
ity conditions finds its expression in the Axioms A, B, C, D below. It may 
be helpful to point out that a symmetric variational problem in parametric 
form will satisfy these axioms when the extremals are considered as geodesics. 
It will appear that there are only two different types of spaces in which the 
geodesic through any two distinct points is unique, similar to the two familiar 
examples of the euclidean and the elliptic spaces. Namely, either the space is 
simply connected and all geodesics are congruent to a euclidean straight line, 
or the Space = is not simply connected and all geodesics are congruent to one 
euclidean circle (with the length of the shorter arc as distance). Then 2 pos- 
sesses a two-sheeted universal covering space 2, which also satisfies the Axioms 
A, B, C, D and which shares the following property with an ordinary sphere. 
All geodesics of 21 are congruent to one euclidean circle, and the geodesics passing 
through a given point X, of 2, all pass through the other point X{ of 2, which 
lies over the same point of = as Xi. 

This theorem will be applied to obtain improvements on the theory of 
spaces with convex spheres as developed in chap. 4 of the author’s Metric 
methods in Finsler spaces and in the Foundations of geomeiry(*). Finally the 
two-dimensional case will be discussed. 

1. Sphere like spaces. The axioms which guarantee the existence of geo- 
desics in the space 2 are these: 


A. = ts metric. 


The distance of the two points A, B will be denoted by AB. We use the 
notation (A BC) to express that the three points A, B, C are different and that 
AB+BC=AC. 


B. = is finitely compact, that is, any bounded sequence of points has an ac- 
cumulation point. 


Presented to the Society, September 13, 1943; received by the editors September 27, 1942. 

® Annals of Mathematics Studies, no. 8, Princeton, 1942. This book will be referred to 
as B. It contains the references to the literature, which will therefore be omitted here. But it 
should be stated explicitly that in the special case of flat spaces our main result was established 
long ago by Hamel (see Hamel [2] in the bibliography of B). Moreover, it has been shown al- 
ready in B that if the geodesic through any two distinct points is unique, each single geodesic 
is congruent to a euclidean circle or to a euclidean straight line. 
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C. = is convex, that is, for A#C, there is a point B with (ABC). 


D. Every point P of = has a neighborhood v with this property: For any two 
points A~B in v and a given e>0 there is a positive 5(A, B) Se to which there 
corresponds a unique point By; with 


(ABB;) and BB; = 6. 


We shall now compile those consequences of Axioms’ A, B, C, D which 
will be needed in the present note. For the proofs the reader is referred to B. 

A point set 7 is called a segment if it is congruent to an interval [a, 5] 
of the real axis, that is, there is a mapping t—>P(t) of [a, 6] on 7 such that 
P(t:)P (tz) =|ti—t2| for any ¢; in [a, 6]. P(t) is called an isometric representa- 
tion of r. The points P(a) and P(d) are the end points of r. Any two points 
are the end points of at least one segment. t(PQ) will denote a segment from 
P to Q. The following trivial property of segments will be used frequently. 

(1) Lf (ABC) and o,7 are segments t(AB) and t(BC) respectively, then o+-7 
is a segment t(AC). 

When the spherical neighborhood v(P, p) of P (consisting of those points 
X for which PX <p) satisfies Axiom D and 0<jSp/6 then v(P, A) is a so- 
called standard neighborhood of P, that is, it has the following properties: 

(2) For any two distinct points A, B in v(P, p) and a given 0<t <1 there is 
exactly one point C with (ACB) and AC=t-AB. And for any given 0<sS25 
there is exactly one point D with (ABD) and BD=s. 

A point set g is called a geodesic if it is a map t—>P(¢) of the real axis 
— «0 <t<o such that for every real to a positive ¢(t9) exists for which the 
subarc | t—to| S(t) of P(t) represents a segment isometrically. P(¢) is called 
an isometric representation of g. 

(3) For a given segment t(PQ) with an isometric representation P’'(t), 
astsb, a<b, there exists one and only one geodesic g which contains t(PQ), 
and g admits an isometric representation P(t) such that P(t)=P'(t) fora Sts. 

The simplest geodesics are the straight lines. They are by definition con- 
gruent either to a euclidean straight line or to a euclidean circle. More ex- 
plicitly, if P(¢) represents a straight line isometrically then for any real 4, ts 
either 

(4a) P(t) P(t2) =|ti—t2| (open straight line) 
or an a>0 exists such that 

(4b) P(t:)P (te) =min ,v=0, +1, +2, - - - (dosed straight line). 
In the latter case g is congruent to a euclidean circle of radius a/27. Hence a 
is called the length A(g) of g. For any point A of g there is exactly one point A’ 
on g (the conjugate point to A on g) for which g contains two segments t(AA’). 

The point A’ may also be characterized as the point on g for which no 
point Y with (AA’Y) exists. For, if A’#ACg then (AAY) with Y=A’. If 
a point Y with (AA’ Y) existed, it would obviously not lie on g. On the other 
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hand, it follows from (1) that a segment r from A to A’ on g forms together 
with any segment (a'r) a segment 7’ from A to Y. Let g’ be the geodesic 
that contains r’. Then g '*g because YC{g. But (3) and the fact that r is 
contained both in g and g’ would yield g=g’. 

A straight line contains with any two points P, Q also a segment t(PQ). 
The converse holds too, so that we may state 

(5) A geodesic is a straight line if, and only if, with any two points P, Q 
it also contains a segment t(PQ). 

In this form the theorem is not in B. However the proof of Theorem 3, 
chap. 1, §2, in B can easily be modified to yield this result. 

As a consequence of (5) we have 

(6) If there is only one geodesic through any two distinct points, then all 
geodesics are straight lines. 

Namely, if g is any geodesic and A, B two different points on g, then a 
geodesic g’ exists containing a given segment t(AB) (compare (3)). Since, by 
hypothesis, g=g’, it follows that g contains a segment t(A B), and hence must 
be a straight line by virtue of (5). 

A space which satisfies the hypothesis of (6) is called an S.L. space, in 
particular an open S.L. space if all straight lines are open, a closed S.L. space 
if they are all closed. 

As stated in the introduction, the main purpose of this note is to show 
that all S.L. spaces are either open or closed, and that in the latter case all 
straight lines have the same length. 

In any space with Axioms A to D the straight lines have this geepenty: 

(7) If two straight lines g and h have two different common points A and A’ 
then either g=h or else g and h are closed, have the same length, and A and A’ 
are conjugate on both g and h. 

For if g is open or closed and A’ not conjugate to A, then g contains a 
point Y with (AA’Y) and a segment @ from A’ to Y. The geodesic / contains 
a segment fr from A to A’. By virtue of (1) r+¢ is a segment t(A Y). Using (3) 
twice we find that the geodesic which contains r+o¢ must coincide with / 
because it contains r and with g because it contains o. Hence, if g#h, the 
points A and A’ are conjugate on both lines, so that A(g) =A(h) =2AA’. 

Let the space = satisfy Axioms A, B, C, D. If all geodesics of 2 are closed 
straight lines and if all geodesics which pass through a given point A pass 
through a fixed point A’, the conjugate point to A in 2, we call = sphere like. 
(The word spherical will be reserved for spaces congruent to a euclidean 
sphere.) 

(8) Let = be a sphere like space. Then all geodesics have the same length. 
The conjugate point A to A’ in = is the conjugate point to A on every geodesic 
through A. The mapping A—A’ of = on itself is an isomeiry. The geodesic 
through A and B is unique, when A and B are not conjugate. 
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The first two assertions follow from (7). 

There is a geodesic g through A and B. Any geodesic g’#g through A 
intersects g, besides at A, only at A’, hence g is the only geodesic through A 
and B. If we denote generally the conjugate point to X by X’, the isometry of 
the mapping A—A’ may be written as 

(9) XY=X'Y’. 

For Y=X or Y=X’ the relation (9) is obvious. When YX, X’ the points 
X and Y are not conjugate. Hence the geodesic through X and Y is unique 
and contains both X’ and Y’. Since XX’ = YY’ and g is a closed straight line 
of length 2X X’ we see that XY=X’'Y’. 

We show next that every sphere like space is covering space of a closed 
S.L. space. 

THEOREM 1. The non-ordered pairs X =(X, X’) of a sphere like space = 
form with the metric 

(10) X¥¥=min (XY, XY’) 

a closed S.L. space 5 in which all straight lines have length XX". 

Proof. Obviously XX =0. If ¥¥Y, then X¥ Yand X¥ Y’, hence XY >0. 

Moreover (9) yields 


XY = min (XY, XY’) = min (YX, YX’) = YX. 


The triangle inequality ~ 
(11) X¥+VYZ2XZ 


follows from (9), (10) and these 4 inequalities: 
XY+Y¥Z2XZ, XZ = XZ’, 
XY + = XY’ + YZ’ = + X'Z' = XZ, 


which also show that the equality sign holds in (11) only if X, Y, Z are on 
one geodesic in 2. 

We see that 5 satisfies Axiom A. Clearly, Axiom B holds too. The metriza- 
tion (10) applied to the points X of 2 which belong to a fixed geodesic g 
(the point X’ lies then also on g) amounts to identifying diametrically op- 
posite points on a circle, and yields therefore a closed straight line in = with 
length XX’. Hence any two distinct points X, Y in 5 are on at least one closed 
straight line. This line contains a segment from X to Y, consequently > satis- 
fies Axiom C. 

For XY<XX'/2 we have 

(12) X¥ =min (XY, XY’)=min (XY, Y¥Y’—XY)=XY. 

Any two points X, Y of a spherical neighborhood v(P, p) in 2 have distance 
XYSXP+PY<2p. Therefore it follows from (12) that the mapping XX 
maps the neighborhood »(P, XX’/4) of P in = isometrically on the neighbor- 
hood v(P, XX‘'/4) in 5. Hence Axiom D holds in 5. 
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The characteristic property of a geodesic to be locally a segment shows 
together with the local isometry of the mapping X—>X that each geodesic 2 
of = consists of those points X=(X, X’) for which X belongs to a fixed 
geodesic in 2. We have seen already that these points form a closed straight 
line of length XX’. Since the geodesic through two not conjugate points X 
and Y in is unique, it follows that the geodesic through X and Y in > is 
unique, as soon as X¥Y. 

By means of Theorem 1 results on closed S.L. spaces lead to results on 
sphere like spaces. For instance Theorems 7 and 8 in B, chap. 5, §1, yield the 
following: 

A sphere like space is spherical (that is, congruent to a euclidean sphere) 
if it can be reflected in each of its straight lines. 

A symmetric sphere like space is spherical. 

2. The main theorem. It is easy to see that an open S.L. space is simply 
connected. The segment t(X Y) connecting two arbitrary points X and Y is 
unique and depends continuously on the end points X and Y. Let X(t), 
a3t3sb, be any continuous curve and P any point. We denote by X,(¢), 
0 Sa 31, the unique point on t(PX(¢)) for which PX,(¢) =aPX(t). The point 
X,.(t) depends continuously on both ¢ and a, moreover Xi(¢) = X(t), Xo(t) =P. 
Hence varying a from 1 to 0 we obtain a deformation of X(t) into P. 

Similarly we see that a sphere like space = is simply connected. For let 
X(t), aSt3b, be any continuous curve which does not cover all of Z, and P’ 
any point not on X(é); finally let P be the conjugate point to P’. Then the 
segment (PX (4)) is unique for every X (4) : and we may proceed as before. 

We are going to show that an S.L. space 5 which is not open is not simply 
connected. This may be accomplished either by showing that a closed straight 
line cannot be contracted to a point, or by constructing a covering space 2 
different from 5. We shall use the second method because it will yield di- 
rectly a two-sheeted sphere like space 2. Since on the one hand 2 is the mini- 
mum number of sheets and on the other hand a sphere like space is simply 
connected, we see that = is, topologically, the only covering space of 5. The 
original space is obtained from Z as in Theorem 1, which implies that all 
geodesics in > are closed straight lines of the same length. 

Let the S.L. space 5 contain at’ least one closed geodesic Z, and let A 
be an arbitrary point on g. The unique geodesic through the two different 
points X, Y of > will be denoted by g(XY). When D¥A and g(AD) is closed, 
then any geodesic g(AX) with XC»(D, ¢) is closed provided e>0 is suffi- 
ciently small (compare B, chap. 3, §1, Theorem 3). 

Call J the locus of the conjugate points to A on the different closed geo- 
desics through A. Finally let »(A, p) be a standard neighborhood of A (com- 
pare (2)) and x its boundary, that is, the locus XA =p. 

We are going to define a mapping X—>X* of 5—f on v(A, p). We put 
A*=A. If A~XCZ-—A, the segment t(AX) is unique, hence there is exactly 
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one point X* with (AX*X) and 
A*X* = p(AX/(1 + AX))((2 + Ag(AX))/dg(AX)), 
where \g(AX) is the length of the geodesic g(AX) and 
(2 + = 1 if \g(AX) = 


Because the length of a geodesic depends continuously on the geodesic (see 
B, chap. 3, §1) the mapping X—>X* is one-to-one and continuous. Call «* the 
subset of those points X* of % for which g(A.X*) is closed. We map the point 
X* of x* on the conjugate point X to A on g(AX*). We then obtain a con- 
tinuous mapping a of 2* =v(A, p)+x* on all of 5. Each point X of B has two 
images X* and X#, which are diametrically opposite points of K. — 

We now assign a definite neighborhood u(X*) to every point of 2* as 
follows: If X is not on B we let u(X) be a standard neighborhood of X which 
does not meet §, and u(X*) the image u(X)a~! of u(X). If XC# and X# and 
X# are its two images we choose a positive 7<p/2 so that (AZ): is closed 
for ZCv(X#, n) or ZCv(X#, 9). We choose a standard neighborhood u(X) 
of X such that g(AY) meets v(X¥#, 7) and »(X#, n) when YCu(X). (This is 
possible because g(A X,)—>g(AX) when X,->X.) Let i=1, 2, be the set of 
those points Y in u(X) for which a point Z; in »(X¥#, n) with (YZ;A) exists. 
We have #+%#,=u(X), and and have only the points Y of in 
common. For if (YZ;A) and Z;Cv(XF, n), i=1, 2, then 


Z:Z2 > X*#X#* — 2n = 2p — 2n > p> 2n. 


If Y were not conjugate to A, the segment t(AY’) would be unique and con- 
tain both Z,; and Z:. But then 


= | AZ, — AZ2| < (9 + n) — (p — 2) = 2n. 


The image u(X#*) of @; we define as neighborhood of X¥. 

Now let 2** be congruent to 2* and designate by X** the image vil x* 
under a definite congruent mapping yu of =* on >*. Let X# and X# be any 
two diametrically opposite points of x*. We then identify X* with X7* and 

** with X#. The sum of 2* and 2** will be called 2. The mappings a 
and ye define together a continuous mapping a of = on 3. Under a-! every 
point X of > has exactly two images X* and X** in 2. We say that X* and 
X** lie over X. 

With every point of 2 we also associate a neighborhood. Namely if ¥¢e 
then we take the previously defined u(X*) as neighborhood of X* and 
u(X**) =u(X*)p as neighborhood of X**. If XCB we take u(X#*)u+u(X#) 
as neighborhood of X#*=X# and u(X*)+u(X#)u as neighborhood of 
X*#=X#*. The neighborhoods u(X*) and u(X**) of the two points X* 
and X** over X are disjoint and @ maps both u(X*) and u(X**) homeo- 
morphically on the standard neighborhood u(X) of X, - isa" 
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We metrize 2 in the following manner: For a continuous curve ¢ with a 
representation P(t), 031, in 2 we define the number X(c) as the length 
of the image P(t) =P(t)a in 5 and put for any two points X, Y in = 

(13) X Y=inf X(cxy), 
where cxy traverses all continuous curves from X to Y. X Y is always defined 
and finite. For, any point of X* of =* (or X** of 2**) can be connected to A* 
(or A**) by a path whose image is one of the possibly two segments from 
X to A, and the points A* and A** can be connected by a curve whose 
image is a given closed straight line through A in >. 

The relations XX =0, XY=YVX, XY+YZ2XZ are obvious and so is 
XY>0, for ¥¥Y (we put generally Za=Z). If X=Y the relation XY>0 
follows from u(X*)-u(X**) =0. 

We show next that with this metrization of 2 the mapping a is locally 
isometric. We see from the definition of A(c) and XY that 

(14) XVY2XY. 

If u(X) =v(X, 5) we put »(X) = v(X, 5/2). Under the homeomorphic mapping 
a of u(X) on u(X) where X stands for a definite one of the two images of X, 
the set v(X) may correspond to v(X). Let A, BCv(X). Since A, BCv(X) the 
(unique) segment t(AB) lies completely in u(X) so that the image of t(AB) 
in u(X) has the same length as t(AB), that is AB. Hence it follows from (13) 
and (14) that 

(15) AB=AB for A, BCv(X). 

We conclude from (15) that v(X) satisfies the requirements of D for the 
point X; since X was an arbitrary point of = we see that Axiom D holds. 

Because of the local isometry (15) the number A(c) equals the length of c 
as curve in 2. 

Axiom B now follows readily. For if PX,<d then PX,<d because of (14). 
Hence {X,} contains a subsequence {X,,} converging to a point X in 2. 
(15) implies that with proper notations the images {X,*} and {X,**} of 
{X,,} tend to the images X* and X** of X respectively. The sequence { X,, 
contain infinitely many points of at least one of the two sequences {X,,* 
and {X,2*}, which proves Axiom B. 

To prove Axiom C we observe that because of B and (13) a curve cxz 
exists which connects two given points X and Z and such that 


XZ= X(exz) 


(compare Property Le in B, chap. 1, §1). The additivity of the arclength and 
the triangle inequality yield in the well known fashion that XY+ YZ=XZ 
for every point of cxz. 

We conclude from the local isometry (15) of = and § that the image 
a= ga of any geodesic in 2 is a geodesic in >, and that at least one geodesic 
gin > lies over every geodesic Z in 5. 

Moreover, if g and h both lie over Z and have a common point P, then 
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g=h, because g and h coincide in a neighborhood of P (compare (3)). It fol- 
lows that there is only one geodesic which lies over a given geodesic Z of 3. 
For if g; and gs lie over 2 we select a point G;on g; such that G,~G:. There is a 
geodesic k through G; and G:. Its image & intersects g at both G, and Gz, 
hence =, and it follows from k-g; DG; that k=g; so that also g:=g2. We 
may now conclude 

(16) If a geodesic g of = contains X*, then g also contains X**. 

For g lies over a geodesic 2 through X, and there is a geodesic g: over Z 
through X**. Hence g=g, )X**. Before we apply this result to the present 
problem we notice a consequence which will be important in the theory of the 
convexity of spheres. 


THEOREM 2. In a closed S.L. space each geodesic 3 intersects the locus B(A) 
of the conjugate points to an arbitrary point A (possibly ZCB(A)). 


Returning to our spaces 2 and 5 we see that the geodesics through A 
must all be closed, for an open geodesic would have two images, one in the 
interior of =* and one in 2**. 

Since the length \(Z) of a geodesic g depends continuously on 2, and all 
geodesics through A are closed, the lengths of the geodesics through A are 
bounded. Hence the space > is bounded and therefore compact. We show next 

(17) Each geodesic g in = is a closed straight line. 

Since 5 is compact = is compact, hence open straight lines cannot exist. 
Because of (5) it is sufficient to show that g contains with any two points 
X, Y also a segment t(X Y). Let X#Y. Consider a segment t(X Y) and a 
geodesic h which contains this segment t(X Y). Then g and & both contain 
X and Y, hence g=h and g=h, so that g contains t(X Y). Now let X¥=Y, 
and consider a sequence of points Y,# Y on g which tend to Y. For large v 
we have Y,~ X. Hence g contains a segment t(X Y,). A suitable subsequence 
of {t(X Y,)} tends to a segment t(X Y). As the image of Z the geodesic g is 
closed as a point set, hence t(X Y)Cg, which proves (17). 

(16) and (17) show that > is a sphere like space. Moreover = can be ob- 
tained from = as in Theorem 1. This implies that all geodesics in 5 have 
length X*X**. We formulate our results as follows: 


THEOREM 3. Let the space satisfy Axioms A, B, C, D and let there be only 
one geodesic through two distinct points of >. 
Then > is either simply connected and all geodesics are congruent to euclidean 


straight lines. 

Or 5 has a sphere like space = as two-sheeted universal covering space, from 
which > is obtained by identifying conjugate points. The geodesics of 5 are all 
congruent to euclidean circles and have the same length. 


3. Convexity of spheres. A sphere K(A, p) is the locus of those points 
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X for which XA =p. The set v(A, p) is by definition the interior of K(A, p). 
When a is the common length of the straight lines of a closed S.L. space 
K(A, a/2) is the locus 6(A) of the conjugate points to A, and K(A, p) is 
empty for p >a/2. In the following the radius p of a sphere is always assumed 
to be not greater than a/2. 

Theorems 2 and 3 enable us to simplify and to complete the theory of 
spaces with convex spheres as developed in B, chap. 4. We shall indicate the 
new method here. 

In an open S.L. space -we would naturally say that K(A, p) is convex if 
a segment t(X Y) whose end points X, Y are in or on K(A, p) lies except for 
X, Y in the interior of K(A, p). But this definition cannot be applied to closed 
S.L. spaces, since then not even all spheres of an elliptic space would be 
convex. In euclidean geometry an equivalent definition of convexity is avail- 
able in terms of supporting planes. One is therefore led to ask whether con- 
vexity can be expressed satisfactorily in terms of the existence of supporting 
geodesics. 

We call the straight line ¢ of a sphere like space or an S.L. space a tangent 
of K(A, p) at Z if points X,# Y, on K(A, p) exist which tend to Z and such 
that the line g(X,Y,) tends to¢. We say that K(A, p) is convex if no tangent 
of K(A, p) contains points of v(A, p). The spheres of an elliptic space are 
convex in this sense. 

On the other hand one might consider dropping the distinction between 
the interior and exterior of a sphere and simply require that the spheres are 
surfaces of order 2. More exactly we say that a set wu of a sphere like space 
or of an S.L. space has order not greater than n if any straight line intersects u 
in at most different points, or else is contained in uw, and that yu has order n 
if it has order not greater than n but not order less than or equal to n—1. We 
can then show the following: 

(18) Let the sphere K(A, p) of an arbitrary S.L. space = not be the locus 
B(A) of the conjugate points to A and have order 2. Then K(A, p) is convex. 

Assume for an indirect proof that a point Z on K(A, p) and a tangent ¢ 
of K(A, p) at Z, which contains a point P of »(A, p), exist. By definition ¢ 
is limit of a sequence of straight lines g(X,Y,) where X,# Y,, X,A=Y,A=p 
and X,—Z, Y,-—Z. Choose P, on g(X,Y,) such that 

First let 2 be an open S.L. space. Then either of the subrays of 9(X,Y,) 
with origin P, contains a point of K(A, p) (because PT varies continuously 
as T traverses g(X,Y,)). The ray which contains X, will also contain Y,, at 
least for large v, because X, Y,—0. But then g(X,Y,) would intersect K(A, p) 
in at least three points. 

Next let = be closed and K(A, p)#8(A). Because of Theorem 2 the line 
g(X,Y,) contains a point Q, of 8(A). Then Q,4 >p. For the same reason as 
before either subarc of g(X,Y,) from P, to Q, contains a point of K(A, p), 
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and for large v the arc which contains X, also contains Y,, so that g(X,Y,) 
would intersect K(A, p) in at least three different points. This result will be 
completed by: 

(19) If the spheres K(A, p) of a closed S.L. space have order not greater 
than 2 then B(A) has order 1. 

We have to show that §(A) contains the line g if it contains the two 
distinct points X and Y of g. Call 5; and 6, the two subarcs of g from X to Y. 
If the theorem were not true, at least one of the arcs, 6; say, would contain 
a point P with PA=p—«, «>0. Then 6: cannot contain a point Q with 
QA =p—€s, €>0, because the sphere K(A, p—min (€:, €:)/2) would contain at 
least four points of £ without containing g. Hence K(A, a/2)=6(A) would 
have to contain 4, without containing g, and would not have order less than 
or equal to 2. 

The results (18) and (19) lead to the following theorem: 


THEOREM 4. In any S.L. space the following properties are equivalent: 

I. The spheres are convex. 

II. The spheres have order not greater than 2. 

III. A given point P has exactly one foot on a given straight line g; or else 
PX is constant for X Cg. 

IV. If the point P does not lie on the line g, then either PX =const. for X Cg 
and all lines g(PX) are perpendicular(*) to g, or else there is exactly one per- 
pendicular through P to g. 


A proof of Theorem 4 may be obtained by combining our present results 
with the results in B, chap. 4, §§1 and 3. Although many simplifications are 
now possible we refrain from giving a complete proof here in order to avoid 
duplication. 

B, chap. 4, §1, Theorem 3, states that I implies III, and B, chap. 4, §3, 
Theorem 1 contains the equivalence of II and III. Our present relations (18) 
and (19) show that II implies I. Finally B, chap. 4, §3, Theorems 4, 5 and 6 
yield together with the present Theorems 2 and 3 that IV follows from II. 
Therefore we have to show here only that IV implies III. Let P not be con- 
jugate to all points of g and not on g (otherwise III is trivial) and consider the 
perpendicular g(PF), FCg, through P to g. Since not all points of g are 
conjugate to P, F (as a foot of P) is not conjugate to P, so that g(PF) con- 
tains a point Q with (QPF). According to the definition of a perpendicular F 
is a foot of Q on g, therefore we have for every X ¥ F on g 


PX > OX —- OP = OF — OP = PF, 


which means that F is the only foot of P on g. 


@) The straight line hg is a perpendicular to g if & intersects g and all points of hk have 
this intersection as foot on g. 
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Theorems 1 and 4 yield, together with B, chap. 4, §2, Theorem 1, the 
following characterization of the elliptic spaces. 


THEOREM Sa. A closed S.L. space of dimension not less than 3, which has 
one of the properties I, II, III, or IV, is congruent to a finite dimensional elliptic 
space. 

A sphere K(A, p) of a sphere like space of dimension not less than 2 whose 
geodesics have length 2a@ is not convex as soon as p>a/2. But the point sets 
which carry K(A, p) and K(A’, a—p) coincide and one of these spheres is 
convex. Therefore we conclude from Theorems Sa and 1: 


THEOREM 5b. Let 2 be a sphere like space of dimension not less than 3, 
whose geodesics have length 2a. If the spheres of radius pSa/2 are convex, or 
af one of the properties II, III, IV holds, 2 ts congruent to a euclidean sphere. 


In B, chap. 4, §2 an example of Hamel was quoted to show that a closed 
S.L. space is not even then necessarily elliptic when its geodesics are the 
straight lines of a projective space. By means of Theorem 1 we see from 
this example that there are metrizations of a euclidean sphere (of any dimen- 
sion not less than 2) in which the geodesics are the euclidean great circles 
but the metric is not spherical. The condition of Theorem 5b may therefore 
not be omitted. 

4. Two-dimensional spaces. The characterization (5) of the straight lines 
suggests the problem to determine all spaces which satisfy Axioms A, B, C, D 
and in which the geodesics are straight lines. As solution one would conjecture 
the S.L. spaces and the sphere like spaces. The proof seems difficult for higher 
dimensions but it is easy for two. 


THEOREM 6. Let each geodesic of a two-dimensional space 2 (with Axioms 
A, .B, C,. D) contain a segment t(X Y) when it contains X and Y. Then = is 
either homeomorphic to the euclidean plane and an open S.L. space, or homeo- 
morphic to the projective plane and a closed S.L. space, or homeomorphic to a 
2-sphere and sphere like. 


Proof. If no two geodesics have more than one common point, the asser- 
tiott is contained in B, chap. 3, §2, Theorem 1. Let therefore two different 
geodesics g and /# which have the two common points A and A’ exist. It fol- 
lows from (7) that they have no further common points and that g and h are 
closed with A and A’ as conjugate points on both lines. 

Call o1, o2 and 71, 72 the two segments t(AA’) on g and h respectively. 
Because of B, chap. 1, §4, Theorem 4, a standard neighborhood »(A, p) of A is 
homeomorphic to a circular disk. Let S;, T; be the point on o; and 7; respec- 
tively with S;A =7;A =p, and call gx a geodesic which carries a segment 
t(A’X), where X is an interior point of the unique segment t(S;7;). By virtue 
of B, chap. 1, §4, Theorem 3, gx intersects t(S;:A)+t(A7;). The point A is 
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the only possible intersection because each intersection of gx with g or h other 
than A’ is conjugate to A’ on gx and g or h. 

Therefore the (unique) segment t(A X) lies on gx. The set >> .t(AX) with 
X C>-:.it(SiT;) covers a whole neighborhood of A, hence all geodesics through 
A are closed straight lines and pass through A’. The geodesic g(A X) is unique 
because X is not conjugate to A, and varies continuously with X. Hence the 
space is homeomorphic to the 2-sphere. 

Consider now any point B#A, A’. There is a geodesic k through A and B. 
Let / be any other geodesic through B. Since / crosses k at B and is a closed 
Jordan curve it follows from the topological structure of a sphere that / inter- 
sects k at some other point B’. We see as before that every geodesic through 
B’ passes also through B. Since B was arbitrary the space is sphere like. 

In an elliptic space we have complete duality between point and hyper- 
plane. The dual of a given theorem is obtained by interchanging a point A 
with the hyperplane 6(A) formed by the conjugate points to A. In a general 
closed S.L. space of dimension 23, hyperplanes are not defined and ques- 
tions of duality become void. 

But we may ask whether there are duality relations which hold in every 
closed S.L. plane. B(A) will in general not be a straight line. But in the elliptic 
case 8(A) may be characterized as the straight line which has maximal dis- 
tance from A. We shall show the following theorem: 


THEOREM 7. For every point A of a closed S.L. plane 5 there is exactly one 


straight line g(A) which has maximal distance from A. The correspondence 
A—g(A) maps 5 topologically on the set x of all geodesics in >. 


By Theorem 3 this theorem is equivalent to the following statement on a 
two-dimensional sphere like space 2. 

For a given point A in = there is exactly one straight line g(A) which has 
maximal distance from A. When A’ is conjugate to A, then g(A)=g(A’). 
The mapping (A, A’)—>g(A) of the pairs of conjugate points on the geodesics 
in 2 is topological. 

We shall prove this fact. Because of Theorem 6 the space 2 is homeo- 
morphic to the 2-sphere. Designate by 4a the common length of the geodesics 
in 2 and by d(A, g) the distance of the point A from the geodesic g. If A is 
not conjugate to X then the line g(AX) is unique for X Cg and intersects g 
also in X’. We see from ., 

(20) 2a=XA+AX'=X'A'+A'X 
that 

(21) d(A, g)=d(A’, g) 
and min (AX, AX’) Sa, hence 

(22) d(A, g) Sa. The equality sign holds if and only tf AX =a for X Cg. 

The number d(A, g) depends continuously on A and g. For a given point 
A there is a line go for which d(A, g) reaches its maximum m Sa. 
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(23) d(A, go)=m=a if and only if A has two feet on go which are con- 
jugate to each other. 

For if F and F’ are feet of A on go, we see from (20) that d(A, go) =AF 
=A F’=a. Conversely if d(A, go)=a, then AX =AX’=a for any X on go 
because of (22). 

The main point of the proof is 

(24) If go has maximal distance from A, then any open segment on go of 
length 2a contains a foot of A. 

Assume for an indirect proof that (24) is not true. Then it follows from 
(23) that go contains no two conjugate feet. Since the feet of A form: a closed 
subset of go, there is a closed segment s of length 2a on go which contains no 
foot of A, so that AX >m-+e for XCs and a suitable e>0. 

The endpoints Y, Y’ of s are conjugate. Call s’ the other segment t(Y Y’) 
on go. A foot F of A is an interior point of s’. Since A F=m <a, there are points 
Z with (A FZ). The line gz=g( YZ) is unique and passes through Y’. Call s7 
the subsegment t(YY’) of gz which contains Z and sz the other subsegment 
t(Y Y’) of gz. For Z—>F we have gz—0, sz—s, s7 —s’ and d(A, gz)—d(A, go). 
Consequently, when FZ is sufficiently small each foot Fz of A on gz belongs 
to s? so that d(A, s?/) =d(A, gz). But s’ separates A from s7/ in the domain 
containing A and bounded by s and sz. Hence d(A, gz) =d(A, s7)>d(A, s’) 
=4d(A, go), which contradicts the definition of go. 

(25) Any line h#go contains an open segment of length 2a which is free 
from feet of A. 

This statement is trivial for A Ch. Let therefore A not be on h and desig- 
nate by D and D’ the intersections of go and h, by pu the side of go which con- 
tains A, and by ¢ the segment t(DD’) on h which is not contained in yw. For 
any interior point G of ¢ the segment t(AG) intersects g at some point 
H#D, D’. Hence G is no foot of A on h, because otherwise 


d(A, h) = AG> AH 2 dA, 


which contradicts the definition of go. We see that the interior of s satisfies 
(25). 

(24) and (25) show that go is the only line which has maximal distance 
from A. We designate this line henceforth by g(A). Moreover g(A) =g(A’). 

(26) If B#A, A’ then g(A) contains an open segment of length 2a which 
is free from feet of B. 

The statement is obvious for BCg(A). If B does not lie on g(A), let it 
lie on the same side of g(A) as A. The line g(AB) contains a point H on g(A) 
with (ABZ). If H is a foot of A on g(A), then any X ¥H on g satisfies the in- 
equality 
BX > AX — AB = AG — AB = BG, 


so that G is the only foot of B on g(A) and (26) holds. If H is not a foot of A, 
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we traverse g(A) from H on, in either direction, until we meet the first foot 
of A. We may thus obtain the feet Fi, F,. Because of (24) we have Fi F:;<2a. 
To prove (26) it suffices to see that t(F,F2) contains all feet of B. If G is any 
point on g(A)—t(FiF2), the segment t(AG) intersects t(A Fi) +t(A F2) at some 
point R. Let RCt(AF,). Since Fi F:<2a the point F; is different from the con- 
jugate point H’ to H, hence we obtain, using that F; is a foot of A, 


BF, < BR+ RF; = BR+ AF; — ARS BR+ AG — ARS BR+GR = BG. 


This shows that G is not a foot of B. 

We conclude from (24) and (26) that g(A)#g(B) for A+B; therefore the 
mapping A—g(A) is one-to-one. The continuity follows immediately from the 
continuity of d(A, g) in A and g. 

It is easily seen that all geodesics of 2 form a set t homeomorphic to the 
projective plane. Let u map 7m topologically on 2. The mapping A—g(A) 
—g(A)u maps 2 on the subset 2’ of those points which correspond to geo- 
desics which have maximal distance from some point of 2. Since the mapping 
of = on 2’ is topological, we have >= Z’. Hence g(A) traverses all of 7 as A 
traverses 2. 

This theorem throws some light on the difficulties of the problem to de- 
termine the closed S.L. planes with convex circles, because it shows how weak 
an assumption the convexity of the circles is in two dimensions. 
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SPECTRAL THEORY. I 
CONVERGENCE TO PROJECTIONS 


BY 
NELSON DUNFORD 


Spectral theory has for its origin the classical canonical reduction of a ” ‘4 


matrix, and it is this classical theory that provides the pattern for the de- a 
velopments presented in this paper. The finite dimensional linear vector space 4 ; 
of the classical theory is here replaced by a complex Banach space ¥ and fi 
the matrix in that theory by a continuous linear operator T in the space %. a 


In this situation there are always three and sometimes four types of con- 
vergence, namely, the uniform, the strong, the weak, and, in case ¥ consists 
of measurable functions, convergence almost everywhere. While our primary 
aim is to give necessary and sufficient conditions for convergence (in each 
of the four types) of a sequence Q,(7) of polynomials in T to a specified 
kind of limit operator, we have, and not merely incidentally but of necessity 
as well as for its own sake, developed a parallel to many of the algebraic 
aspects of classical matrix theory. In fact the chief tools in the convergence 
theory here presented are theorems corresponding to (a) the minimal equa- 
tion theorem for matrices, (b) the theorem of Sylvester concerning the de- 
terminant of a polynomial in a matrix, and (c) the various decompositions _ 
of the whole space into a direct sum of its subspaces determined by the spec- 
trum of a matrix. This analogue to classical matrix algebra plays its role 
almost exclusively in the questions involving uniform convergence (which of 
the four types seems to be the most difficult one to handle; especially in mak- 
ing the proofs of necessity rather than those of sufficiency) and the reader 
interested only in the other three types of convergence will find the last sec- 
tion fairly complete in itself as far as these questions are concerned. 

The type of limit operator E=lim, Q,(7) with which we shall be chiefly 
concerned is a projection of the whole space ¥ on the manifold M[P] consist- 
ing of all x©X where, for a specified polynomial P, we have P(T)x=0. In 
seeking conditions for uniform convergence of a given sequence Q,(7) towards 
such a limit, one is faced with two fairly obvious necessary conditions, one 
of which is P(T)Q,(T)—0. A third condition which will make the set of 
three both necessary and sufficient may be any one of three types, namely, 
(a) of spectral type and asserting that the roots of P(A)=0 be either in the 
resolvent set of T or else poles of the resolvent, or (b) of algebraic type and 
asserting that the manifold N[P]=P(7)z is a closed complement of M[P], 
or (c) of topological type and asserting that for every root A of P(A)=0 of 
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multiplicity vy we have (AJ —T7)’*!X¥ closed. One way of visualizing the differ- 
ence between strong convergence and uniform is through the corresponding 
algebraic condition for strong convergence which is R[P]@M[P] = x. 

The ergodic theorem falls within the scope of this theory, for in the 
ergodic theorem we are dealing with the case of a projection on the fixed 
points of an operator, that is P(T)=I—T. As the general situation shows, 
the conditions for convergence of the particular sequence n—")_*~1,7™ of poly- 
nomials are, if worded properly, equally valid for any sequence of poly- 
nomials Q,(7) with Q,(1)—1, but for the particular sequence n-)>*-T™ 
the condition P(T)Q,(T) =(I—T)n-")_%-T™ = (I—T")/n-0 takes the rela- 
tively simple form of 7*/n-0. It is the condition 7*/n-+0 and not the cus- 
tomary one of | 7*| $M which is essential in the ergodic theory. 

Since an element of a normed ring (in the sense of Gelfand) when con- 
sidered as a linear operator on that ring has its norm as an operator the same 
as its norm in the ring, the purely algebraic results of §2 as well as those 
results in §3 concerning uniform convergence may be thought of as results 
concerning an element T of a normed ring. 

The indigenous connection between the spectrum of an operator T and 
questions of convergence of sequences of functions Q,(7) of T is most readily 
seen (but not completely comprehended) by an examination of the spectrum 
of a finite matrix. For this reason and also because the finite case furnishes 
some insight into the puzzles of the infinite one we shall first present certain 
aspects of classical matrix theory in a form roughly as they shall appear in 
the general case. 

1. Spectral theory in a finite dimensional space. We shall not discuss this 
case as completely as we shall the infinite one but enough will be given to 
indicate the general trend of ideas. In fact we shall merely derive a formula 
(equivalent to the classical canonical reduction) which will serve as a basis 
for an operational calcilus and show how questions of convergence may be 
settled by spectral considerations. Except for the first two definitions and 
lemmas which are not concerned with the dimension or the topology in ¥ 
(and which will be needed in the infinite case) we are concerned here with a 
linear vector space ¥ of dimension »>0 over the field of complex numbers 
(or any algebraically closed field) and a linear operator T (that is an nXn 
matrix) which maps & into all or part of itself. 


1.1. Derinition. Let P be a polynomial, \ a complex number, and v a posi- 
tive integer or 0. We define 


N[P] = P(T)E, m[P] = € =o}, 


Ni = (AT — T)’R, M = E€ [Al —7)’s=0], 


where I is the identity operator in %. 


‘ 
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1.2. DEFINITION. Jf M, Mi, ---, Ms are linear manifolds in X¥ we shall 


write M=MiG --- OM: if and only if MiCM, i=1,---, k, and every 
has a unique representation x=x1+ +x, 
We shall write - - +M if and only if MsCM, +=1, ---, k, and 
every xe M can be represented as x=x,+ +--+ +x, 


1.3. Lemma. Jf P, Q are polynomials with no common roots then 
M[P-Q] = m[P] Mil. 
Clearly M[P], M[Q}CM[P-]. Since P and Q have no common factors 
there are polynomials R, S with 
RAPA) + = 1, 
R(T) P(T) + S(T)Q(T) = 1, 
R(T) P*(T) + S(T) P(T)Q(T) = P(T). 


Thus if P(T)Q(T)x=0, that is, if x<EM[P-Q], then P(7)(x—y)=0 where 
y=R(T)P(T)xEM[P]. This decomposition is unique, for the second of the 
above identities shows that u=0 if ~EM[P]-M[]. 


1.4. Lemma. Let P(A) =] [%.,0As—A)” be @ polynomial whose distinct roots 
are dy, - ++, Ax. Let Ps(A)=P(A)/Ai—A)”*. Then 


= Mi 


and 


= N[Pi] +R[Pr). 


The first conclusion is proved by a repeated application of Lemma 1.1 
and the second by observing that since P(A), - - - , P(A) have no common 
factor there are polynomials Ri, - - - , Rx with 


Ri(A)Pi(d) + + Re(A)Pi(A) = 1, 
Ri(T)Pi(T) + R(T) PAT) = I. 


1.5. DEFINITION. The index v of a complex number d is the smallest positive 
integer or zero such that (T—dI)’t'x=0 implies (T—dI)’x =0. 


Thus the index » of X is positive if and only if \ is a characteristic number 
of the matrix 7. In what follows we use the symbols A, - - - , Ax for the dis- 
tinct characteristic numbers of T and », - - - , vz for their indices. 


1.6. Lemma. If then P(T) =0 and hence 
OMe 


To see that P(7) =0 let x be an arbitrary vector in ¥. Since the dimension 
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of is m there are constants - , with -- + +a,7T%x=0, 
that is, Q(7)x=0 for some polynomial Q. If Q is written in factored form 
then T@d —T)™‘x=0 and it is clear that this equation remains valid if all 
factors with u; not a characteristic number are removed, and from the defini- 
tion of v; we see that the other factors may be changed to have the desired 
exponents. Thus P(T)x=0, P(T)=0. The second conclusion follows from 
Lemma 1.4. 


1.7. DEFINITION. Let k, be the projections which, in view 
of Lemma 1.6, exist and satisfy I=E),+ --+ +Exy, Ex 
Ey,Ey, =0, ixj. 


Note that since CM we have =TEy,x, that 
is TE,,=Ey,T. 


1.8. DeFiniTIon. Let F(T) be the class of all complex functions of a complex 
variable which are regular at every point of an open set (not necessarily con- 
nected) containing the characteristic numbers \iy,---, Xx of T. (The open set 
upon which fEF(T) ts regular may vary with f.) For fEF(T) we define 


i=l m=0 

1.9. THEorEM. If f, g& F(T) and a, B are complex numbers then 

(a) af+BgEF(T) and (af+fg)(T) =af(T)+8g(T), 

(b) f-gEF(T) and (f-g)(T) =f(T)-2(7), 

(c) if f\) then f(T) 

Statements (a) and the first part of (b) are obvious. To prove that 
(f-g)(T) =f(T) -g(T) let us write, for hE F(T), 

Dy,(h, T) = — h\™)(d,). 
aud m! 

Then since (J for v; we have 


p I)? 


Dy(f, T)Da.(g, T)Ex, = Er, 


p=0 n=0 


(T — 


p=0 p! 


vi-l (T — 


p=0 p! 


Di (f-8, T). 


Hence 
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t=1 i=l 


k 


> Da(f-g, = (f-g)(T), 


t=1 


which proves (b). In view of (a) and (b) statement (c) need only be verified 
for the special cases of f(A) =1, f(A) =A and (c) for these polynomials is an 
immediate consequence of Definition 1.7. 


1.10. THEOREM. If f, gE F(T) then f(T) =g(T) if and only if f™(A;) =g™(A,), 
i=1,---,k;n=0,---,v;—1. Thus for a polynomial P we have P(T)=0 if 
and only if P(A) contains [[*_,(Ai—X)”* as a factor. 


The sufficiency of the condition is obvious from (*) and its necessity 
follows from the definition of v; as follows. In view of part (a) of Theorem 1.9 
we may and shall assume that f(7)=0. Let x¥0, (T—A,J)x=0 so that by 
(*) and 1.7 we have 0=f(T)x=f(A,)x, f(x) =0. Next, in case »;>1, pick 
x so that (T—A,J)*x=0, (T—A,J)x¥0 then as before 0=f(T)x=f(A,)x 
+(T—A Df Aix =f Ax) (T—AD)x, and so f(A;) =0. A repetition of this 
argument clearly yields f‘(A;)=0, 1,---, »;—1. 


1.11. Let f,€ F(T), then f,(T) converges if and only if 
converges for eachi=1,---,k,andj=0,---,v;—1. 


The sufficiency of the conditions is clear from (*) while their necessity 
follows from the argument used in the proof of the preceding theorem. 
The next theorem is proved in the same way. 


1.12. THEorem. Let f, F(T) then f,(T)—f(T) if and only if lim 


1.13. An illustration. If f,(A) =n-1)_%-\A™ we have fa(A) converging if and 
only if |\| $1, while f(A) converges for all j=1, 2, - - - providing |A| <1 
but the first derivative f(A) fails to converge if |A| =1. Hence (1.11) shows 
that n-!)"*-4,7™ converges if and only if all roots \; of T have |\,;| $1 and 
if in addition »;=+1 for every root 4; with |\,;| =1. Next consider the state- 
ment 7*/n— 0. An application of (1.12) to the functions f,(A) =A"/n shows 
us that 7*/n-0 if and only if the roots \; of T satisfy the same conditions 
required in the preceding example. Hence n—")("-),7™ converges if and only if 
T*/n—0. It is not pure coincidence that these two statements concerning con- 
vergence have precisely the same spectral interpretation. It is a consequence 


of the fact that the statement 7"*/n-0 is equivalent to (I—T)n =47™ 
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=(I—T™*)/n->0 and is a special case of a general principle which will be 
developed more completely in §3. 


1.14. Concluding remarks. The formula 


(i) => pmagm, 


i=l m=0 


of 1.8 becomes 


(ii) KT) = En, 

for a matrix T whose characteristic numbers ), all have index 1. In particular 
(ii) holds if T is an Hermitian matrix. This case has been generalized in an 
extensive literature to the concept of a self-adjoint operator in Hilbert space 
where (ii) takes the well known form of f(T) = /t2f(A)dEy, and in recent years 
has been given, by M. H. Stone(*) and I. Gelfand(?), an algebraic topological 
form. One difference between the ring of polynomials in a general matrix T 
and the ring of polynomials in an Hermitian matrix T is that (as formulas 
(i) and (ii) show) the former ring is isomorphic with a direct sum of reduced 
polynomial rings while the latter is isomorphic with a direct sum of identical 
fields, that is, isomorphic with a linear space of scalar functions. 

If the projections £,, in (i) are eliminated from (i) by using the Hermite 

interpolation formula then (i) takes the very natural form 


1 fA) 

dd, 

(itl) f(T) 2ridJc AI —T 
where C is the boundary of a set of small circles containing Au, - - - , Ax. If C; 
is a small circle about \,; then (iii) gives 
(iv) 1 dy 

M—T 

Formula (iii) was suggested by E. Cartan(*) as a possible basis for an opera- 
tional calculus for infinite matrices and was used (only in an incidental way) 
by Gelfand. Formulas (iii) and (iv) were discovered by Fantappié(‘) who 
was working with finite matrices, and recently (iv) has been extended to the 
infinite case by Lorch(), who proved that an operator E defined by the equa- 


(*) M. H. Stone, A general theory of spectra. I, Proc. Nat. Acad. Sci. U. S. A. vol. 26 (1940) 


pp. 280-283. 
(?) I. Gelfand, Normierte Ringe, Rec. Math. (Mat. Sbornik) N. S. vol. 51 (1941) pp. 3-24. 
(*) The suggestion was made in a letter to G. Giorgi. For this reference as well as for others 
per.aining to an operational calculus see C. C. MacDuffee, The theory of matrices, Berlin, 1933. 
(*) L. Fantappié, La calcul des matrices, C. R. Acad. Sci. Paris vol. 186 (1928) pp. 619-621. 
(5) E, R. Lorch, The spectrum of linear transformations, Trans. Amer. Math. Soc. vol. 52 


(1942) pp. 238-248. 
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tion E=(2ri)-'fc(AI —T)—dd is a projection, and that the algebra of such 
projections is isomorphic with the Boolean algebra of all open and closed sub- 
sets of the spectrum of T. 

2. An operational calculus and some algebraic properties of linear opera- 
tors. The procedure of §2 used to di$cuss the general linear operator T differs 
only slightly from that of §1. We first define the class 7(7) and the operator 
f(T) for fEF(T) and then define the projections corresponding to the Ey, 
of §1. The fundamental algebraic theorems (Theorems 2.12 through 2.29) 
may then be formulated in terms of these concepts. 

2.1. Preliminary concepts and definitions. From now on the symbol T will 
stand for a bounded linear operator on a complex Banach space %. By the 
resolvent set p(T) will be meant the set of all complex numbers A such that 
(AI—T)-! exists as an everywhere defined (and hence continuous) linear 
operator. For \Gp(T) we shall write R(T) for (AJ—T)-. The set p(T) is 
an open set containing all \ with |A| >|7|. By the spectrum o(T) of T is 
meant all complex numbers \€9(T). The spectrum o(7) is a closed point set 
contained in the circle |A| <| 7'|. If the space ¥ contains more than one point 
(and this we assume) the spectrum (7) is not empty (see Taylor(*)) and we 
write |o(T)| for l.u.b. |r| where A varies over o(T). The point spectrum o,(T) 
consists of all \ such that (AJ—T)-" does not exist, the continuous spectrum 
o.(T) of those \ for which (AJ—T)~ exists as an unbounded operator with 
domain dense in %, and the residual spectrum o,(T) of those X for which 
(AI—T)-! exists with domain nondense in %. It is readily shown that if 
(AI — T)-! exists as a bounded operator with domain dense in ¥ then ACE p(7), 
so that the point sets p(T), ¢,(T), ¢.(T), o,(T) are disjoint and their union 
is the whole complex plane. The function R,(T) satisfies the resolvent equation 
R(T) —R,(T) R(T) RAT), 4, and is regular, that is, 
dR,(T)/dd exists, at every point \€ p(T). If f(A) is any vector (or operator) 
valued function, regular and single-valued at every point of a closed domain 
D whose boundary C is a finite number of rectifiable Jordan curves then 
(1) (2) fA) =(2mi)— for in the interior of D, 
(3) f(A) has derivatives of all orders at every point of D, (4) f‘() 

= (24i)—'n! fof(~)(E—A)—-*-dé, for d in the interior of D, (5) the Taylor ex- 

sion > a-o(A—£)*f™(é)/n! converges uniformly to f(A) for in any circle 
ner | Sr contained in the interior of D. If f(A) is regular and single-valued 
at each point of an annulus enclosed by two concentric circles C; and C; with 
center \) and if C is any circle between C; and Cz, then for every X in this 
annulus 


1 d)dr 
SA) a,(A Xo)”, a, = 


n=—o 


n=0,+1,+2,---. 


(® A. E. Taylor, The resolvent of a closed transformation, Bull. Amer. Math. Soc. vol. 44 
(1938) pp. 70-74. 
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The above statements concerning regular functions may be proved as fol- 
lows. For a linear functional over %, that is a point x*€%*, we have 
x* = fcx*f(A)dd and thus by Cauchy’s theorem =0. Hence 
by the Hahn-Banach extension theorem (which was proved for complex linear 
functionals over a complex Banach space by Bohnenblust and Sobczyk(’)) 
we have {cf(A)dA=0. This proves (1) and (2) is proved in the same way while 
the rest of the statements follow in the usual manner. 

Many standard theorems in the theory of linear operators on a Banach 
space are readily seen to hold in a complex Banach space and these will be 
used when needed. In particular the various corollaries of the Hahn-Banach 
extension theorem including the principle of uniform boundedness will be used. 
Also the interior mapping theorem or, more specifically, its corollary which 
asserts that a closed everywhere defined linear operator is continuous will be 
used. This shows that the projections E, E’ = J—E determined by closed com- 
plementary linear manifolds in ¥ are continuous. 


AN OPERATIONAL CALCULUS 


2.2. DEFINITION. By a T-admissible domain is meant an open set D of com- 
plex numbers having the following properties : 

(1) Dis a finite sum of connec'zd open sets D; with D;D;=0, i¥j. 

(2) The boundary C of D consists of a finite number of disjoint closed recti- 
fiable Jordan curves contained in the resolvent set p(T) of T. We shall sometimes 
write D=D(T) to mean that D is a T-admissible domain. 


2.3. DeriniTion. The class F(T) is the class of all complex functions each 
of which is regular and single-valued at each point of the closure of some D=D(T) 
which contains the spectrum a(T) of T. 


2.4. Lemma. Let D, D* be T-admissible domains with o(T)CD*CD*CD. 
Let @(d) be regular and single-valued on D. Then 


f smacna= f sora, 
c c 


where C, C* are the boundaries of D, D* respectively. 


The domain D is made up of a finite sum of disjoint sets and taking the 
integral around C is the same as taking the integral around each connected 
component and adding. Thus, without loss of generality, we may and shall 
confine our attention to the case where D is connected. Suppose first that D 
is simply connected and that D* is connected. Since ¢(T)CD* none of the 
inside boundaries of D* can surround a part of o(T) and hence the integral 
around every inside boundary of D* vanishes. Thus the integral {¢* reduces 


(7) H. F. Bohnenblust and A. Sobcezyk, Extensions of functionals on complex linear spaces, 
Bull. Amer. Math. Soc. vol. 44 (1938) pp. 91-93. 
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to the integral around the outside boundary of D* which may be joined by 
a cut to the boundary of D to prove the lemma in this case. Next suppose 
only that D is simply connected. This case reduces to the first case by making 
cuts joining up the various parts of D*. Now suppose the lemma has been 
proved for the case where D is (m—1)-tuply connected. We now take D to 
be n-tuply connected. If at least one of the inside boundaries of D lies out- 
side of all the outside boundaries of the various parts of D* it may be joined 
by a cut to the outside boundary of D, thus reducing D to a (n—1)-tuply 
connected domain. If every inside boundary C; of D lies in one of the holes 
of some part of D* and is thus surrounded by some uniquely determined 
nearest part C* of the boundary C* then fc,=fc*. Also any inside boundary 
C# of D* which does not surround such a C; nor any portion of D* has the prop- 
erty that fct=0 and this type of C* may be dropped. These observations 
reduce the problem to the case where (i) every inside boundary C; of C is 
contained in at least one of the holes in some part of D* and (ii) every inside 
boundary of D* surrounds either (a) an inside boundary of D or (8) one or 
more parts of D*. Here parts (a) and (8) are not mutually exclusive. Now 
consider an inside boundary C¥ of D* which comes under (ii)(8). The various 
portions of D* within C* may be joined by cuts and one of them to C¥#. 
This process reduces the problem to the case where every inside C; is in one 
and only one inside C* and every inside C* contains one C;. Now it is clear 
that the integral around the remaining (uncut) outside boundaries of D* is 
equal to the integral around the outside boundary of D and the integral 
around such inside boundary of D* is equal to the integral around the corre- 
sponding boundary of D and thus fc=/c*. 


2.5. Lemma. Let D, D* be T-admissible domains with o(T)CDD*. Then 
there is a T-admissible domain D, with o(T)CDiCDD*. If is regular and 
single-valued at each point of D and at each point of D* then 


f f 
Cc Cc 


where C, C* are the boundaries of D, D* respectively. 


The first part of this lemma follows from the Heine-Borel Theorem and 
the second part then follows from the preceding lemma. 


2.6. Derinition. Let fE F(T), D=D(T), DDa(T) and let f be regular and 
single-valued on D. Let C be the boundary of D. We define 


The preceding lemma shows that f(T) is independent of D. 
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2.7. THEOREM. f, F(T) then 

(1) f+ gE F(T) and f(T)+2(T) =(f+2)(T), 

(2) f-gEF(T) and f(T)-g(T) =(f-g)(T). 

Statement (1) as well as the first part of (2) is obvious. To see that 
f(T) -g(T) =(f-g)(T) let C be the boundary of a domain D=D(T) such that 
o(T)CD and such that f and g are both regular and single-valued on D. 
Let C; be the boundaries of the component parts of D. Then 


C; j=1 Cj 


n=1 


Now 


R(T) — R,(T 
Cn Cu A- 


a — R,(T) J) 


A= c, 


Hence if n#m and uEC,, we have fc, f(A)(A—z) =0 and 


A)R 
f 
Cm 


c, 


-f 


and thus (*) becomes 


= f dd f 


n=1 


(**) 
R(T) — R,(7) 
--2 f f 


Now using each point of C, as a center construct a circle of radius so small , 
that f, g are both regular in the closed circle and also such that the radius is 
less than half the distance from the point on C, to the remaining contours 
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bounding D. A finite number of these circles cover the boundary C, of the 
component D, of D. Let €, be the sum of the interiors of this finite number 
of circles and let D,*=€,+D,. Let C,* be the boundary of D,*. Clearly we 
have o(7)D,CD,*, D, CD,*. Then 


RT) RAT) 


f ar 


Cn 


A— 
— 2wif(u)R,(T) 


and (**) may be written as 


k 
4x*f(T)-¢(T) = f flu)e(u)Ry(T)dy 


For C,* we have =0 and so 
1 k 
K(T)-g(T) = R(T) du 
Cy 


1 


2.8. THEOREM. Jf f(A) is a polynomial in d or, more generally, if f(d) 
is @ series convergent in a circle <p with p>|T| then 
F(T) = 


Let C be the circle |A| =p so that 


1 1 


fQ) = 
d= aT. 


n=0 


2.9. THEOREM. If fE F(T) then f(o(T)) =e(f(T)). 


Let A\Go(T) and let g(£) be defined for ~ in the domain of definition of f 
by the formula g() = (f(A) —f(€))(A—£)— so that gE F(T) and, by 2.7 and 
2.8, we have f(A) I —f(T) = (AI—T)g(T) which shows that f(A) Go(f(T)). Con- 
versely let wGo(f(T)) and suppose that wE¢f(o(T)). "Then the function 
=(f(€)—u)-* belongs to F(T) and, by 2.7 and 2.8, =I, 
which contradicts the fact that wEo(f(T)). 
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2.10. Let fEF(T), gEF(f(T)) and let F(E)=g(f(é)). Then 
FEF(T) and F(T) =g(f(7)). 

Suppose g is regular and single-valued on D* where o(f(T)) =f(o(T)) CD* 
and let C* be the boundary of D*. Pick D=D(T) such that o(7)CD, f is 
regular and single-valued on D and f(D) CD*. Then if C is the boundary of D 


we have from 2.7 
R(f(T)) = : Re(7) dé AEC 


f 
= J 8 a(f 


(LY wena 


1 


The elementary algebraic rules of operation given by 2.7, - - - , 2.10 in- 
clusive will be used from this point on without explicit reference to the theo- 
rem in question. 


2.11. DeFIniTion. By a spectral set of T will be meant any subset o of o(T) 
which is both open and closed in o(T). If o is a spectral set of T the symbol co’ 
will be used for the complement of o in o(T) so that a’ is also a spectral set of T. 


It is readily seen that a set o of complex numbers is a spectral set of 
T if and only if it is of the form ¢=a(T)D where D=D(T), and that for any - 
spectral set o of T there are T-admissible domains D, D’ with 


o = o( T)D, o’ = o( T)D’, DD’ = 0. 


Such domains are said to be complementary domains and D’ will be called a 
complement of D. If C, C’ are the boundaries of D, D’ respectively and we 
define 


1 1 
E,[T] = E,:[T] = 


then it is seen from 2.7 and 2.8 that E,[T], E,-[T] are projections reducing T 


and that 
£.[T]-£.-[T] = 0. 


With every spectral set o of T is associated the closed linear manifold 
¥.[7]C% and defined as ¥,[7]=E,[T]%. Thus if o1, - --, o% are disjoint 
spectral sets of T and c=0,+ - - - +o, we have 


| 
and 
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E,[T] = E.,(T] + + E.(T], E.,(T]-E.;[T] 0, i# 
X.[T] = X(T]. 


When there is no chance of confusion we shall sometimes write E, and X, in 
place of E,[T] and %,[T] respectively. If the spectral set o reduces to a single 
point A we shall write Ey, %, for E,, ¥, respectively. 


2.12. THEOREM. Let o be a spectral set of T. Then when T is considered as 
an operator in the space %, it has o for its spectrum. Furthermore for \Go any 
one of the following statements is true if and only if it is valid when T is re- 
garded as an operator in ¥,: d is in the point spectrum of T, d is in the residual 
spectrum of T, X is in the continuous spectrum of T, d is a pole of order v for 
R,(T), is an essential singularity for R;(T), d is an isolated point of the spec- 
trum. 


Let Ao. We shall first show that (AJ—T) is a 1-1 bicontinuous map 
of %, into all of itself. Let D=D(T) be such that s=o(7)D, AXED, and let D’ 
be a complement to D. Define fE F(T) so that f(£) = (A—£)— for — in a neigh- 
borhood of D and f(£)=0 for ~ in a neighborhood of D’. Then 


Ql f(T)E. = f(T). 


Hence if x€%¥, we have y=f(T)xEX, and (AI—T)y=x, and if AI—T)u=x 
with we have so that (AI—T) 
is a 1-1 map of %, into all of itself. Thus when T is considered as an 
operator in %, its spectrum is contained in ¢. Now if A\Go then AGo’ 
and by what has just been proved we have (AJ—T) a 1-1 map of &,, into all 
of itself. If (AJ —T) were also a 1-1 map of &, into all of itself we see from the 
equation =%,@X,- that (AJ—T) would be a 1-1 map of & into all of itself, 
which is impossible since \Go. Thus when T is considered as an operator in 
%, it has ¢ for its spectrum. Next let A\Go be in the point spectrum of T. 
Let x #0 be such that (AJ—T)x=0 so that AI—T)E.x=0, AI-—T)E.x=0. 
Since (AJ—T) is 1-1 on X,- we have E,,.x=0, and hence x =£,«x and d is in 
the point spectrum of T when T is considered as operating in ¥,. The con- 
verse is obvious. Now for any A\Go we have 


and so (AJ—T) is dense in but not equal to & if and only if (AJ —T7)&, is dense 
in but not equal to ¥,. Thus for \Go either of the following statements is true 
if and only if it is true when T is regarded as operating in ¥,: \ is in the con- 
tinuous spectrum of 7, is in the residual spectrum of T. Finally let \ be an 
isolated point in ¢. The Laurent expansion of R;(T) about the point A is 

1 R,(T) dé 


R(T) = = — 


} 
(AI — T)¥ = (AI — (AT — = (Al — 
i 
| 
| 
( ail 
4 
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where C is a small circle about A. Thus 


1 
Cc 


and hence J is a pole of order v if and only if 
QI-T)K=0, #0. 


Thus A will be a pole of order v when T is regarded as operating in %, if and 
only if 
(AI — T)’E,E, = 0, (AI — T)""EB,E, # 0, 

but since \Ge we have E,E,=E) and hence the two conditions are equiva- 
lent. This completes the proof of the theorem. 

The following corollaries will be used frequently. 

2.13. THEOREM. If o is a spectral set of T and Xo then (AI—T) is a 1-1 
bicontinuous map of %, into all of itself. 

2.14. THEoreM. If o is a spectral set of T then E,[T]=TI if and only if 
o=o(T), and E,[T]=0 if and only if o is empty. 

This is an immediate consequence of 2.13. , 


2.15. THEOREM. If X is an isolated point in the spectrum of T then d is a 

pole of order v if and only if 
Ql-TYE.=0, <0. 

This was established in the proof of 2.12. 

2.16. DEFINITION. As in the finite case we say that a complex number d has 
index v (a positive integer or zero) in case (AI —T)’*+'x =0 implies (AI —T)"x =0 
and there is an with (AI—T)’x =0, (AT 0. If no such integer v 
exists we say that d ts of infinite index. For any complex number d and inleger 
n= 0 we define 


Mi = E [Aar—T)*x= 0]. 
xEX 

2.17. THEOREM. Jf d is a pole of order v for R;(T) then dX has index v and 
X,= My, =Nx, where is the spectral set of T complementary tor. Further- 
more =Nx for n=v while for 1S Nz is a proper subset of Ni’. 

By 2.15 we have 
(AI — T)’E, = 0, (AI — 0. 
Thus there is an x with AJ—T7)’E,x =0, (AT —T)’"Ex 0, and so the index 


of d is at least y. Now suppose that for some m and x we have (AJ —T)*x=0. 
Then since 
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=, (Al — T)* 


we see that the function 


n—1 — i 
> 


j=0 


is regular over the entire plane except perhaps at the point =X. Thus if K 
is any contour surrounding o(7) and C is a small circle about A we have 


(**) f = — f R(T) xdt = Exx, 


and hence (AJ —T)’x = (AI —T)’E,x=0, which proves that \ has index at 
most v. Thus J has index v. Equation (*) shows that ¥,C DX and (**), which 
holds for shows that Thus Pi =X,. Now 


Ok =M kr, 
and by 2.13 
= (A — C (AT — =M 


so that Since has index v we have and hence 
X=DM Thus and MN are both complements to MK and which 
proves that =%.. By 2.13 = (AI —T)*N = Al —T)*¥, =N and 
so N*=NZ for Sv. Since if, for some k, we have for all 
n 2k, to prove the final assertion of the theorem it will suffice to show 
that NX is a proper subset of Mi~'. Since \ has index vy there is an x with 
AI =0, 0 Since 0X and =0 we 
have yEN~'—Nx which completes the proof of the theorem. 

There is a converse to Theorem 2.17. It states that if for some \ and m 
we have Ny @ My =* and NF closed, then d is either in the resolvent set p(T) 
or else a pole of order at most m for R;(T). This result is a special case of 
2.23 which will be proved later. 


2.18. THEOREM. Let \ be either in p(T) or an isolated point of the spectrum. 
Let & be the distance from > to the rest of the spectrum. Then the following siate- 
ments are equivalent : 

(1) xE%,. 

(2) lim, (AJ —T)*e-*x =0 for every e>0. 

(3) For some positive ¢< 6 the equation (2) is satisfied. 


To see that (1) implies (2) let C be the circle |\A—| =p where 0<p<e. 
Then 
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(AI — T)"e-"E, = 0 


Conversely suppose (2) is satisfied for some positive e< 5. Let 0<61.<€<@<é;, 
and C; (¢=1, 2, 3), the circle |A—€| =€;, be chosen so that the spectrum 
o(T) with the exception of the point =d is in the open annulus bounded by 
and C;. Then the function f,(€) = [1 —(A—£)*e-*]— is in ¥(T) and 


1 
C1 Cy C3 


fn(T) 


Since we are assuming (2) to hold for € we have f-1(T)x =x —(AI —T)"e-*"x-x 
and thus for arbitrary p>0, |x — <p, lfa(T)x —2| <plf.(T)| for 
all large n. Since f,(7)—>E, we have | Eyx —x| <p|£,| and since p is arbi- 
trary, =x. 


2.19. THEOREM (THE MINIMAL EQUATION THEOREM). Jf fE F(T) then 
S(T) =0 tf and only if 
(1) For every pole X of R:(T) of order v 


fr) = 0, 
(2) f(A) =0 in a neighborhood of the spectrum a(T) excluding poles of R;(T). 


Suppose (1) and (2) are satisfied. Let D=D,+ ---+D, be a domain 
upon which f is regular and single-valued and with D)o(T). Clearly, in view 
of (2), 


(*) Mm=—f 


where C* is the boundary of those D; containing no singularities of R;(T) ex- 
cept a finite number of poles. Let Ax, - - - , Ax be the poles contained within C* 
and let 1, - - - , », be their orders. Then from (1) we have f(A) = (A; —A)”#g:(A) 
where g; is regular at A; and g,;GF(T). Hence by 2.15, f(T)E,, 
=g:(T) (AJ —T)"*E,,=0 and hence from (*) we have 


k 
KT) = = 0. 
Now conversely suppose f(7)=0. Then 0=o(f(T))=f(o(T)) and so f(A) is 
identically zero in any connected domain (upon which it is regular) contain- 
ing infinitely many points of o(7). Hence equation (*) holds where C* is 
now the boundary of a finite number of domains containing only a finite 
number of singularities of R;(7). Let Au, - - - , An be the poles within C* and 
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"1, their orders. Let , Am be the essential singularities within 
C*. Now suppose that for some 7 Sm” we have 


S(O) 0, 0, j 0, 1, 


where k<»;. Define a function g€7(T) so that in a neighborhood of A; we 
have f(A) =(A;—A)*g(A) while g(A)=1 in a neighborhood of the spectrum 
o(T) excluding \;. Thus g, g~' are both in 7(T). Since k<»; we have by 2.17 
a point xX), with (A,J —T)*x+0, and thus since g(7) has an inverse 


I(T)x = f(T) = g(T)(I — 0, 


which is a contradiction. 

The coefficient a_,-1 of (A;—£)~*~' in the Laurent expansion of R;(T) 
about the point =), is (as was shown in the proof of 2.12) —(A,J—T)*Ey, 
and thus for i>mn we have (A,J—T7)*E,,#0 for all m=1, 2,---. Just as 
before suppose that for some i>m and some k21 we have f )(A;)+0, 
=0,7=0, - - - , andas before define gE F(T) so that g-'(T) exists 
and f(T)E,,= A.J —T)*g(T)E,,#0, which again is a contradiction. 

An immediate corollary is 


2.20. THEoremM. If f, gE F(T) then f(T) =g(T) if and only if 
(1) For every pole X of R:(T) of order v 


SPO) = gM), 1. 


(2) f(A) =g() for every \ in a neighborhood of the spectrum a(T) excluding 
poles. 


2.21. THEOREM. Let - , Ax be poles of R;(T) of orders vy, ++ + , ve re- 
spectively. Let be the complement of the spectral set ¢= (1, - , Xx). Then for 
every fEF(T) we have 


vi-1 T 


This is an immediate corollary of 2.20. 
2.22. DeFiniTIon. Let fE F(T), then as in 1.1 we define the manifolds 


2.23. THEOREM. Let fEF(T) and suppose that FX) is not identically zero 
on any of the domains in which it is regular. Let di, - « « , Xx be the roots of f(d) 
and m, ---, m, their multiplicities. Then the following statements are equiva- 
lent: 

(1) MIfl =X, ts closed. 


i 
| 
Bi 
ij 
i 
| 


202 NELSON DUNFORD - [September 


(2) Fori=1,---, k, X; is either in the resolvent set p(T) or else a pole of 
order v; sm; of R:(T). 
(3) The finite set o=(Ax, - - - , Ax)o(T) is spectral set of T and 


Assuming (1) we have continuous projections E, E’ with EX=M[f], 
E'%=N[f], EE’=0, I=E+E’, TE=ET. Also for any \ we have (AJ—T) 
a 1-1 map of & into all of itself if and only if (AJ—T) is a 1-1 map of EX into 
all of itself and of E’% into all of itself. Thus the spectrum o(T) of T is o1+0/ 
where o:, o/ are the spectrums of T when it is considered as an operator in EX, 
E'% respectively. In EX we have f(T) =0 and so @; is contained in the point 
set (Ai, - - +, Ax). Since f(7)E=0 we have f(7)=f(T)E’ and hence 


= = = f(T)R[F] C = RU). 


Therefore, since M[f]-N[f]=0, f(T) is a 1-1 map of N[f]=E’ into all of 
itself. Hence when regarded as an operator in E’¥, 0€p(f(7)) and hence 
\sEp(T), i=1,---,k. Thus of is a closed point set not containing 
o=0(T)(Ai, - - - , Ax). This shows that ¢ as well asd; is a spectral set of T. 
Now let x=Ex+E’xEX,,, then 2.18 gives 

lim (A.J — T)*e-*E’x = 0, e>0, 
and hence since (A;,J—T) is a 1-1 bicontinuous map of E’% into all of itself 
we have E’x=0 and x=ExEM|[f]. Thus 


O--- OU, CML]. 
On the other hand if «© Dif we have (by 2.18) x€%,, and so by 1.4 


M[P] = Mr CX, 


where P(A) =] [%_,(A:—A)™. If A(A) is defined on the same domain as f(A) 
by the equation f(A) =P(A)-h(A) we see that h(A) has no roots in ¢(T) and 
hence h-1(T) exists, which shows that M[P]=M[f] and thus the above equa- 
tions give M[f]=%.. Now since f(A)#0 and in ¥,=M[f] we have f(T) =0 
we see from the minimal equation theorem that in X, every spectral point ); 
must be a pole of order »;Sm;. Thus (2) follows from 2.12. It has therefore 
been proved that (1) implies (2). During the course of this proof all but the 
last part of (3) has also been established. To see that ¥,,=%[f] note first 
that since 0€ f(o’) we have (by 2.12) 


Xe = f(T). C f(T)¥ = 
Next 


= 
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and so %[f] and %, are both complements of M[f] and ¥.,CR[f]. This 
clearly implies that ¥,,=[f]. Thus (1) implies (3). Clearly (3) implies (1). 
Now assume (2) and let v;=0 if \; is in p(T), then if P(A) is the polynomial 
defined above we have by 2.17 and 1.4 


OMI 
@--- OM, = = 
Also by 2.17 and 1.4 we have, for every x such that P?(T)x =0, 
OM =MS--- OME = 
and so P(T)x=0, that is N[P]-M[P]=0. Thus since (using 2.12) 
N[f] = D P(T)z. = 


we have 


and ¥,,=[f] is closed. This completes the proof of 2.23. 

The formula o(f(T))=f(¢(T)) may be given a much sharper form. To 
see this let us first examine the finite case of an Xm matrix T whose 
distinct characteristic numbers are );, ---, Ay. The formula asserts that 
f:), - - >, f(x) are the characteristic numbers of f(T). However a theorem 


due to Sylvester asserts more; namely, if Au, - - - , A, are the characteristic 
numbers of T, each repeated according to its multiplicity, then f(A1), - - -,f(n) 
are the characteristic numbers of f(T), where the number of repetitions of a 
given number in this array is its multiplicity. In the language of determi- 
nants this theorem of Sylvester asserts that if Ay, - --, A, are the distinct 
characteristic numbers of T and if m; is the multiplicity of A; then 
Det(f(T) —AJ) =[ (fAx) —A)™. Recalling that the multiplicity T) 
of a root A; of T of index v; is the number of linearly independent solutions 
of =0, that is T) is the dimension of Dy = Dyk, it is seen 
that the Sylvester theorem, that is the formula 

(i) m(u, f(T)) = >) m(a, T) 

' (where m(u, f(T)) is the multiplicity of u as a root of f(T) and the sum on 
the right is taken over all \ such that f(A) =) is an immediate consequence of 


*w) 
(where the sum is a vector sum). Formula (ii) which states that two mani- 


folds are equal is for many purposes more desirable than (i) which merely 
states that the dimensions of these manifolds are the same. On the other 
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hand (ii) is readily derived from (i) as follows. Suppose (AJ—T7)’x=0 
(that is x©%X,(T)), where f(A)=yu, then 
=(AI—T)’g’(T)x=0 and so x€%,[f(T)]. Thus the left side of (ii) con- 
tains the right but by (i) the dimension of the left side is the same as the 
dimension of the right side and so the manifolds are the same. A form of 
this theorem which is more desirable than (ii) is 
(iii) BIT), 

which states the equality of two projections rather than (as (ii) does) the 
equality of the manifolds upon which they project. 

In Theorems 2.24 and 2.29 to follow we shall give two forms of (iii) ap- 
plicable to a general linear operator T. 


2.24. THEOREM. Let fE F(T) and let r be a spectral set of f(T). Then 
o=a(T)-f-'(r) is a spectral set of T and 


E,[f(T)] = £,[T]. 


Let r’ be the complement of 7 in o(f(7)). Then since o(f(T)) =f(¢(T)) 
we have o+o’=0(7) where o’=a(T)-f~'(r). Since 7, r’ are closed and dis- 
joint and f(A) is continuous we see that o, o’ are closed and disjoint and 
thus they are complementary spectral sets of T. Let D, D’ be f(T)-admissible 
domains with DD’ r’=o(f(T))D’. Then there are T-ad- 
missible domains D;, Di with =0, ¢=0(T)Di, o’=0(T)Di , f is regular 
on D,+Dj and CD, f(D!) CD’. Let g, be two characteristic functions 
with gE F(f(T)), AE F(T) and such that 


ED, 1,g€D, 
and let F(é)=g(f(é)). By 2.10 we have F(T)=g(f(T)). Since F(£)=A(E) for 
£€D,+D, we have F(T)=A(T) and 
E,[f(T)] = g(f(T)) = F(T) = = £.[T]. 


2.25. LemMA. Let IT be a set of complex numbers, and for each ECT let 
B(é), An(E), m=1, 2, - + - , be bounded linear operators which have inverses. Let 
M, Mi, M2, +--+ be constants such that 


|A Ol rer. 


Then tf lim, A»(£)= B(£) uniformly for EET we have also lim, A = B-(&) 
uniformly for EET. 


The proof is the same as that given by Gelfand(*) for a special case. 


(®) Loc. cit. 
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We shall prove the lemma first for the case B(£)=J. In this case there is an 
integer mo such that <1/2 for and n2m. Thus for n 2m 


s2, eer. 


p=0 


Hence l.u.b.g ace | =K< and 
which proves the special case of B(£)=J. Now in the general case we have 


| — 1| =| (4,8) — < BO], 


and so by the first case we have 
lim B(é)A, =I 


uniformly on I’. Thus 


which proves the lemma. 


2.26. Lemma. Let B, An, n=1, 2, , be bounded linear seneners and 
let T be a closed set of complex numbers ‘with o(B)p(A 1, 2, . Then 
if A,—B we have R;(A,)—R,(B) uniformly on T. 


Let B(é)= EI —B, A,(£)= £I—A,. Since is closed and in p(B) it must 
be at positive distance from o(B). Since | R,(B)| is continuous on I and 
| Re(B)| —0 as we see that l.u.b.cr | B-1()| < ©, similarly for 
The lemma thus follows from the preceding one. 

2.27. DEFINITION. By R(T) we shall mean the ring of all operators of the 
form f(T) where fE F(T), and by R(T) the ring of all operators which are limits 
(in the uniform topology of operators) of elements in R(T). 


2.28. Lemma. With every UER(T) is associated a uniquely defined scalar 
function f called-the spectral function of U, such that: 

(i) The domain of definition of f is o(T) and f(A) is continuous for \Go(T). 

(ii) If F(T) and f,(T)U then f(A) uniformly for \Eo(T). 

(iii) f(o(T)) Co(U). 

If f.€ F(T) and f,(7)—>U then for every \€o(T) we have fu) 
€e(f.(T)—fa(T)) and so 


Lu.b. | fa(\) — S| fa(T) — fu(T) | > 0. 
AGo(T) 


Thus f(A) = lim, f,(A) exists on ¢(7) and is continuous there. If also g,(7)-~U 
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then by the same argument |f,(A)—ga(A)|—20 uniformly for \Go(T), and 
so f(A) is independent of the particular sequence f,(7)ER(T) used to ap- 
proach U. To prove (iii) let \Go(T), so that f,(A)€e(f,(7)), and suppose 
that f(A) Eo(U). Since and [f)I—U]" exists we 
have, for all large n, [f.(A)—fn(T)]-' existing, which contradicts the fact 
that f,.(A) Go(f,(T)). 


We are now prepared to give the general form of the Sylvester theorem. 


2.29. THEOREM. Let f be the spectral function of an operator UER(T) and 
let r be a spectral set of U. Then o=f-*(r) is a spectral set of T and 


E,[T] = E,[U]. 


Let r’ be the spectral set of U complementary to 7 and let o’=f-'(r’). 
By the preceding lemma f(¢(7))Cr+r’ and so ¢+0’=0(T), oo’=0. Since 
t, T’ are both closed in o(U) and f is continuous we see that ¢, o’ are both 
closed and hence are complementary spectral sets of T. Let D, D’ be U-ad- 
missible domains with r=D-o(U), r’=D’-o(U), DD’=0 and let C be the 
boundary of D. Let f,.G F(T) and f,(7)—U. Since f,(A)—f(A) uniformly on 
o(T) and f(o(T)) Cr+r’ we have for all large n 


(*) filo) CD, falo’) CD’. 


Thus f,(¢)=f,(¢)D is a spectral set of f,(7) and o(7)f='(f.(¢)) =¢, and so, 
by 2.24, E.[T]=Ey,:«) [fn(T)]. Now (*) shows that CCp[f,(T)] and thus, 


by 2.26, uniformly for Hence 
1 1 


which completes the proof of the theorem. 


2.30. THEorEM. Let pCo(f(T)) where fE F(T) and suppose that f(A) Fu on 
any of the domains in which it is regular. Let i, - - - , Xx be those points d in 
o(T) where f(\) =p. Then 

(1) If wis a pole of Re(f(T)) then du, - - - , Xx are poles of R;(T). 

(2) Conversely if Xi, - - - , Ax are poles of R;(T) then pis a pole of R:(f(T)). 

Since f(A) is not identically 4 on any of the domains in which it is regular 
it is clear that there are only a finite number Ai, - - - , A, of roots of f(A) —u=0 
which lie in the spectrum o(7). Since f(¢(T)) =¢(f(T)) and wEo(f(T)) there 
is at least one root of f(A) —y=0 in o(7). Let mi, - - - , m, be the multiplici- 
ties of Ay, - - - , Ay as roots of f(A) —y=0 and let AG F(T) be defined on the 
same domain as f by the equation 


w— = Ar — — 
so that (A) has no roots in (7) and thus h~'(T) exists. To prove (1) we have, 
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by 2.24, 7=(Ai1, - - - , Ax) a spectral set of T and 
(*) E,[f(T)] = Ey[T] + --- + 4, [7]. 
Now for any integer we have 
[ul — f(T) )"Z,[f(7)] 

= (A — T)™"-- (Ae — T)™*h"(T)(Ey[T] + --- + B,[T)). 
If n—1 is the order of uw as a pole of R;(f(T)) we have, by 2.15, the left side 
of (**) vanishing, and upon operating with E,,[T] we get 

(Au — T)™"- — T)™"h(T)E,,[T] = 0. 


Since A4~'(T) exists and since, by 2.13, (AjJ—T) for 72 is a 1-1 map of %,, 
into all of itself we have 


(**) 


(AZ — T)™*E,,[T] = 0, 


which, since , is isolated in (7), proves (by 2.15) that A; is a pole of R,(7). 
Now conversely suppose that \;,7=1, - - - , k, is a pole of order »; for R;(T) 
and let # be an integer with nm2v;,i=1, -- -, k. Since f(A) is continuous on 
a(T) it is seen that yp is isolated in o[f(7)] and thus (*) and hence (**) are 
valid. Since, by 2.15, (A;J—7)"**"E,,[7] =0 this same theorem shows in view 
of (**) that yu is a pole of R,(f(7)). 


2.31. DEFINITION. Let S be the unit sphere in %. An operator U in & is 
said to be compact in case US is compact in ¥ and it is said to be weakly compact 
in case US is weakly (sequentially) compact in %. 


We prefer the terminology of 2.31 to the usual terms, that is, completely 
continuous and weakly completely continuous. 


2.32. THEoreM. Let fEF(T) be such that f(T) is compact. Then every 
AEo(T) with is a pole of and the corresponding manifold ¥,[T] 
has finite dimension. 

By a well known result of F. Riesz(*) u=f(A) is isolated in the spectrum 


o(U) of the compact operator U=f(T). Thus if c=o(7T)f-'(u) we have, by 
2.24, o a spectral set of T and 


%,[U] %.[T]. 


Now by 2.13 we see that U is a 1-1 map of %,[U] into all of itself and hence the 
unit sphere in %,[U] is compact and %,[U], therefore, has finite dimension. 
Thus the spectrum of T where it is considered as operating in ¥,[7] is finite 
and by 2.12, therefore, ¢ is finite, which shows that A is a spectral set of 7. 
Hence %,[T7]C%,[T] has finite dimension and, in ¥,[T], AJ—T) has the 


(*) F. Riesz, Uber lineare Funktionalgleichungen, Acta Math. vol. 41 (1918) pp. 71-98. 
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single point 0 for its spectrum. Thus in ¥, [7] the matrix (AJ—T) is nilpotent, 
that is, for some m, (AJ —7T)"E,[T] =0 which proves that A is a pole of R;[T]. 


2.33. THEOREM. Let f,G F(T) and let U=lim, f(T) be a compact operator. 
Then every \Ga(T) for which f,(d) does not approach 0 is a pole of R;(T) and 
the corresponding manifold X,|T | has finite dimension. 


Let f(A) be the spectral function of U so that f(A) =lim, f,(A) for every 
Thus if u=f(A) we have, by 2.28, and by Riesz’s result 
» is isolated in o(U). The proof now proceeds as it did in the preceding theo- 
rem except that we use 2.29 instead of 2.24. 


2.34. THEOREM. If 7” is compact then every spectral point 10 of T is a 
pole of R:(T). 


This is an immediate corollary of 2.32. 


2.35. The Fredholm theory. If U=f(T) is compact, where f(A) is regular 
in the circle |A] S| 7| and has no roots other than \=0 in this circle, then 
2.32 shows that all points 10 in o(7) are isolated in o(T) and are poles of 
R,(T). In view of 2.17 it is seen that all of the principal results of the Fred- 
holm theory are valid for the operator 7. In particular if T is a weakly com- 
pact operator in a Lebesgue space L it is known (Dunford and Pettis(!°) 
and Phillips('")) that 7? is compact and so the Fredholm theory is valid for 
weakly compact operators in L. 

3. Convergence to projections. 


3.1. THEOREM. Every projection EER(T) is in R(T) and E=E,[T], where 
o is a spectral set of T and consists of all \Ga(T) where the spectral function 
f(A) of E has the value 1. Thus E=0 if and only if there arc. no such X. 


The proof will require the following lemma. 


3.2. LEMMA. Every \ 0, 1 is in the resolvent set p(E) of a continuous projec- 
tion E and 


R(E) = E/E-1)+ #0,1. 


Thus E,{E]=E, Eo[E]=I—E, o(E) =0 if and only if E=0; o(E)=1 if and 
only if E=I, and o(£) =¢,(E£). 


The formula for R;(Z) is verified by multiplying by £J—E and the re- 
maining conclusions follow immediately. 


(**) N. Dunford and B. J. Pettis, Linear operations on summable functions, Trans. Aimer. 


Math. Soc. vol. 47 (1940) pp. 323-392. 
(4) R. S. Phillips, On linear transformations, Trans. Amer. Math. Soc. vol. 48 (940) pp. 


516-541. 


| 
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Theorem 3.1 now follows from 2.29. If we take o of that theorem to be 
the point 1 then E=E,{E]=E,[T] where o consists of all \Go(T) with 


=1. 


3.3. Lemma. If f,G F(T) and tf d is a pole of order v for R;(T) then the 
. weak convergence of f(T) implies the existence of the limits 


lim (a), 


Furthermore if f,(T)—>f(T) weakly where fEF(T) then 


By 2.17, A has index v and so the lemma follows from-2.21 by the same 
argument used to derive 1.11 and 1.12 from the formula in 1.8. 


3.4. THEOREM. Let P be a polynomial not identically zero and let f,EG F(T), 
n=1,2,---.Iffa(T) converges to a projection E with EXCM|P] then either 
there are no \Ca(T) where f,(A)—1, in which case E=0, or else: 

(1) The set o of points \Ga(T) where f,(A)—1 consists of a finite number 
of poles of R(T). 

(2) If 1, +++, ve are the orders of the poles di, - - - , Xx then 


(3) Fori=1,---,k, sts a root of P(A) and its multiplicity is at least v;. 

(4) 

If there are no A\Go(T) where f,(A)—>1 then E=0 by 3.1. Now suppose 
the set o of all A\Go(T) where f,(A)—>1 is not empty. We have, by 3.1, ¢ a 
spectral set of T and E=E,[T]. Since EXCM[P] we have P(T)E.(T)=0 
and so (1), (3) and (4) follow from 2.19. Statement (2) follows from the pre- 
ceding lemma. 

3.5. Lemna. Let n=1, 2,---, and let PQ) be 
a polynomial whose distinct roots are dx, - - - , Ax. Then if f,.(T)P(T)—0 weakly 
and lim, =a;#0,i=1, +--+, k, we have 

If A\;s€ p(T) the conclusion is obvious. If A;Go(T) then f,(A) is regular 
at A; and we may expand P(A)f,(A) about A=A;, 

(A (m) 


— ,)™ 
P(A) fa(d) = [ P(A) -fa(d) + (A — 4) 


m=0 


£n,i(A). 


If the function g,,; which is regular in a neighborhood of }, is defined by the 


if 
a 
4 
4 
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above formula on the whole of the 7-admissible domain D, upon which f, 
is defined and regular, then g,,,€G7(7T). Now since P“(A;)=0, r<v;, and 
#0 we have 
(m—r) (r) 
m mvrin (Ax) = 0, m < Vi, 

m = Vi, 

and so 

(*) P(T) = ————— fui) PO (i) 


Vy. 
Since P(T)f,(T)—0 weakly, this equation shows that for xC Dit’ we have 


and since (\;) #0 we have xE Mii. Thus = = = M},, 2 2v;. Since 
P(T)f,(T)-—0 weakly, (*) gives 


and since a;P’?(\;) #0 we see that every xC Vi is the weak limit of a se- 
quence Thus Wi and hence 


Now since (A;J —T)P(7)f,(T)-—+0 weakly we conclude, by what has just been 
proved, that = A repetition of this argument shows that Nz, = Ws 
for all 


3.6. Let be a@ polynomial whose distinct 
roots are \y, ++, Let F(T) satisfy 

(1) A) 0, i=1, »k3j7=1, | 

(2) P(T)f.(T)—0. 
Then the following statements are equivalent. 

(3) E, E*=E, EX=M[P]. 

(4) Each \;,i=1, +--+, k, is either in p(T) or else a pole of R;(T). 

(5) Fori=1,---,k, X4 is either in p(T) or else a pole of R;(T) of order 
at most v;. 

(6) is closed. 

(7) Nit’, ¢=1,-+-, k, és closed. 
Furthermore when ‘the limit E exists the set Ax)o(T) is a spectral 
set of T and 


E=E£,(T], M[P]=%, 


In view of 3.4 statement (3) implies (4) and (4) in view of 3.5 and 2.17 
implies (5). From 2.23 it is seen that (5) and (6) are equivalent. Also 2.17 
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shows that (5) implies (7). Now assume (7). Lemma 3.5 gives 
= mid wi", 
so that for n2v;. Since = Ni, we have for every xCX a Nii 
such that —T)”z, that is and so 
(*) ¥ = Mai + Mi and Nii is closed. 
Now if A;€o(T), f, is regular at A; and so for \ near \; 
fn 
fo) = 
imo)! 
If gn,(A) is defined by the above equation for all \ in the domain D, where 
f, is defined and regular then g,,,E7(7) and 


(A — Aa)? (A — Aa) (A). 


vi-1 x. 
= 
J 


Thus for xE Ms we have, using (1), 


f(T)x = >> fn (T — x. 


j=0 
Now if P;(A) =P(A)/(A—A,)”* we see from Lemma 1.4 that P,;(7)x #0 provid- 
ing O¥xE Mi. Thus if we have f,(T)x—x, Pi(T)f,(T)x 
= P(T)f,(T)y—0 (by (2)) and so P;(T)x=0 and hence x =0. This shows that 
- = 0. Thus (*) gives 


= MKS @ Ni, Nii is closed. 


Thus 2.23 applied to the function f(A) = (A;—A)”* shows us that A; is a pole 
of order at most v; for R;(7). Hence (7) implies (5). We have now shown the 
equivalence of (4), (5), (6), (7) and also the fact that (3) implies any one of 
them. We shall now show that these conditions imply (3). Assuming (5) we 


have by 2.21 
mi—1 (T 
f(T) = + fr OD EnITI, 
jul j! 

where (Ai, , Ax)o(T) and m; is the order of as a pole of R;(T). 
Now in %,- the spectrum of T is o’ (2.12) and hence since 0€& P(e’) we see 
that P(7) is a 1-1 bicontinuous map of %,- into all of itself. ‘Thus f,( EZ. 
=f,(T)P(T)P-'(T)E.—0 (by (2)). The above formula for f,(7) now shows 
(using (1)) that f.(7)—)0a,e-Ex,[T]=E.[T]. The final statements of the 
theorem follow from 2.23. 
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3.7. THEOREM. Let P(A), fn(A) be as in the preceding theorem and satisfy 
(1) and (2) of that theorem. Fori=1,---,k let (AxIJ—T) take bounded closed 
sets into closed sets. Then statements (3), - - - , (7) of the preceding theorem are 
all true. 


Since (A,J — T) takes bounded closed sets into closed sets it may be proved 
(see Banach(!*) where the corresponding statement is proved for a compact 7) 
that (A.J —T7)z is closed. Since a repetition of this 
principle gives the fact that (A,iJ—7)*X is closed, and so on, that is, Ni, is 
closed for all m=1, 2, - - - , and thus (7) of 3.6 is satisfied. 


3.8. THEOREM. Let P(A), f(A) be as in 3.6 and satisfy (1), (2) of that theo- 
rem. Suppose that 0 is not a root of P(A) and that T" is compact for some integer n. 
Then (3), «+ +, (7) of 3.6 are all true. 


By 2.34 all spectral points \+0 of T are poles of R;(7) and so 3.8 follows 
from 3.6. Theorem 3.8 is also an immediate corollary of 3.7. 


3.9. Tuzorem. Let be polynomial whose distinct 
roots are \y, +--+, Ax. Let frac F(T) satisfy the following: 

(1) A) 0, i=1,-- +, f=1, +++, 

(2) P(T)f.(T)-0 strongly. 
Then the following conditions are equivalent. 

(3) strongly, E?=E, EX=M[P]. . 

(4) f.(T)x is weakly compact, x CX. 

(5) R[P]om[P]=x, |7.(7)| <M. 


As in the proof of 3.6 we have for yE Mi 

and so 1.4 gives the fact that f,(T)y—y for yEM[P]. Also (2) shows that 
f.(T)y—0 for yEN[P]. Now if we assume (4) we see that |f,(7)| <M and 
hence f,(T)y—0 for yEN[P]. Thus R[P]-M[P] =O. Since |f,.(T)| SM it is 
seen that f,(7)x converges for every x in the closure of N[P]+M[P]. It will 
now be shown that 
(*) R[P] = x. 
If this is not true there is an x€X and an x*CX* with x*x=1, x*M[P]=0 
=x*M[P]. By 1.4 there are points x;EM[P;] with x=x,4+ - - - +x, and, 
since x*x=1, not all x*x; are zero. Suppose x*x;~0 and pick a sequence n; 
of integers so that f,,(T)x:7y1 weakly. Since x1GN[Pi] it is of the form 
and (iJ Thus Ad 


(2) S. Banach, Théorie des opérations linéaires, Warsaw, 1932; in particular Théoreme 11, 
p. 151. 
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—0 which that that is, Since 
x*I[P]=0 we have therefore 


x*f,(T) x1 — 0. 


Now it is clear that it is no loss of generality to assume that A1€o(7), for if 
\i1€ p(T) hypothesis (2) remains valid if P(7) is replaced by Pi(T) and also 
mM [P]=M[P,.]. Thus we shall assume \i1Go(7) and hence f,(A) is regular at 
A=A,. As in the proof of 3.5 we have 


@ 
f(T) = + (T — 
j=0 
and hence, since (T—\iJ)"'x1=(—1)"P(T)a and x*[P]=0, we have from 
(1) 
(i) 
7=0 

The fact. contradicts (**) and proves (*). Thus on the basis of (4) we have 
shown that 


(T ALD) x* ~ 0. 


= lim f,(T)x, x &, 


exists. Statement (2) shows that P(T)E=0 and hence EXCM[P]. But, as 
was shown above, f,(7T)x—x for xEM[P] and hence E*x =lim, f,(T)Ex =Ex 
so that E*=E, EX=M[P]. Thus we have shown that (4) implies (3) and (5). 
It remains to be shown that (5) implies (3). But, as was shown at the begin- 
ning of the proof, (1), (2) and \fn(T)| <M imply the convergence of f,,(T)x 
for every xER[P]+M[P] and hence Ex=lim, f,(T)x exists for every x CX. 
The above argument shows that E£ is a projection of the whole space onto the 
manifold MP]. This completes the proof of the theorem. 

As Theorem 3.4 shows, the condition (1) of 3.6, that is, f,(A;)->1, 
fPA)—-0, 157 Sv;—1, is a necessary condition for the uniform convergence 
f.A(T)-E, E?=E, EX=M[P] providing P(A) is the polynomial of smallest 
degree for which P(T)f,(7)—0. As the following theorem shows, a completely 
analogous situation holds in the case of strong convergence. 


3.10. THEorEM. Let P(A) be a polynomial and f,€G F(T) be such that 

(1) fa(T)—E strongly, E3=E, EX=M[P]. 
Let P'(d) be the polynomial of smallest degree such that I[P]=M[P’]. Let 
P''(A) be the polynomial of smallest degree such that P’'(T)f,(T)—30 strongly. 
Then 

(2) P’(A) =P’’(). 

(3) IfX4is a root of P’(d) of multiplicity v then d is in the point spectrum of T 
and has index v and 
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Clearly 0Sv{ Sv; and since (1) implies P(7)f,(T)->0 strongly we see that 
0sv/' S»,. Also since, in the sense of strong convergence, we have P’(T)f,,(T) 
—P'(T)E=0 we see that 0S»/’ Sv/ Sv;. Let yw; be the index of d;. By 1.4 
we have - - - GPM, from which it is clear that 
every root A; of P’(A) has index u;2v/ and hence every root of P’(A) is in 
the point spectrum. Since f,, is regular at such a root we may write 

(7) 
n= 
j=0 
If and (AJ —T)x=0 we have f,(T)x =f,(Ax)x and since 
we have f,(T)x—«x, that is f,(A;)—1. If v{/ >1 then there is an x with 
(T—d,J)x #0, (T =0 so that xEM[P] and 
(1) 


x = + fn (i)(T — AD) x, 
which shows that f<”(A;) 0. This argument may be repeated to give 


(T — + (T — gns(T). 


j! 


fa) 0, j 1, 1. 


Since 0Sv{’ Sv! we see therefore that (1), (2), and (4) of Theorem 3.9 are 
satisfied with P\A) replaced by P’’(A) and hence, by that theorem, f,(7) con- 
verges strongly to a projection of ¥ on M[P’’]. This shows that M[P’’] 


=M|P], which proves that v/ Sv/’ and hence P’(A)=P’’(A). It remains to 
be shown that the index y; of a root A; of P’ is equal to its multiplicity v/. 
We have already seen that w;=v/, and 3.5 applied to the polynomial 
P''(A) =P’(A) gives u;Sv{. This completes the proof of the theorem. 

The discussion of weak convergence is almost identical with strong con- 
vergence, and so we shall not discuss it in full but merely state the following 
two theorems. These theorems may be proved by using the methods of proof 
employed ia 3.9 and 3.10. 


3.11. TuEorem. Let be polynomial whose distinct 
roots are +++, Ax. Let F(T) satisfy 

(1) fas)—1, 0, +++, 

(2) P(T)f.(T)-0 weakly. 
Then the following assertions are equivalent. 

(3) f(T) weakly, E?=E, EX=M[P]. 

(4) f.(T)x is weakly compact, x CX. 

(5) R[P]eom[P] =x, |f.(7)| <M. 

3.12. THEOREM. Let P(A) be a polynomial and f,G F(T) be such that 

(1) f.(T)E weakly, E2=E, EX=M[P]. 
Let P’(d) be the polynomial of smallest degree such that M[P]=M[P’]. Let 
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P''(A) be the polynomial of smallest degree such that P’'(T)f,(T)-0 weakly. 
Then 

(2) P’A)=P"’(). 

(3) IfX4is a root of P’(d) of multiplicity v then d is in the point spectrum of T 
and has index v and 

3.13. Let be polynomial whose distinct 
roots are Ax. Let F(T) be such that 

(2) fa(T)E weakly, E2=E, EX=M[P]. 
Then 

(3) strongly if and only if P(T)f,(T)—>0 strongly. 

(4) fa(T)E uniformly if and only if P(T)f.(T)—0 uniformly and N[P] 
ts closed. 


This is an immediate consequence of Theorems 3.6, 3.9 and 3.11. 

In the following discussion of almost everywhere convergence we suppose 
that ¥ is composed of measurable scalar functions x(¢) defined for ¢ in a set S. 
The functions x©% are measurable with respect to a completely additive 
non-negative set function ® which is defined on a Borel field containing S. 
We suppose also that S is a denumerable union of sets of finite measure and 
that x =0 if and only if x(¢) = 0 almost everywhere. Addition and scalar multi- 


plication of the functions x(f) correspond to the similar operations in %. It 
is also assumed that if x,—x in ¥ and x,(t)—y(¢) almost everywhere then 
x(t)=y(t) almost everywhere. For a linear operator U in ¥ we shall write 
U(x, t) for the value of the function Ux at the point ¢. 

The following theorem of Banach(!*) is fundamental. 


3.14. THEOREM. Let U,, be a sequence of linear operators in & satisfying 
(1) For every xCX, lim supa | U,(x, t)| < «© almost everywhere, 
(2) For every x in a fundamental set in X the lim U,,(x, t) exists almost every- 
where. 
Then 
(3) For every xCX the lim, U,(x, t) exists almost everywhere. 


3.15. THEOREM. Let P, f,, E be as in 3.9 and satisfy (1), (2), (4) of that 
theorem. Then 

(1) for every lim, f,(T)(x, t) = E(x, t) almost everywhere 
if and only if : 


(#) S. Banach, Sur la convergence presque partout des fonctionelles linéaires, Bull. Sci. Math, 
(2) vol. 50 (1926) pp. 36-43. This theorem of Banach has also been used by Yosida in generaliz- 
ing the ergodic theorem of G. D. Birkhoff, see K. Yosida, Ergodic theorems of the Birkhoff- 
Khintchine's type, Jap. J. Math. vol. 17 (1940) pp. 31-36. 
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(2) for every xEX, lim sups |fn(T)(x, t)| < © almost everywhere, 


and 
(3) for every xEX, lim, P(T)f,(T)(x, t) exists almost everywhere. 


Statements (2) and (3) are clearly necessary for (1). Now conversely (3) 
shows that f,(7)(x, ¢) converges almost everywhere for xEN[P], and for 
xE Mi we have 

= : (T — ix, 
and so f,(7)(x, t) converges almost everywhere for xEM[P] Thus 
3.9(5) and 3.14 give the desired conclusion. 


THE ERGODIC THEOREM 


The following theorems are obtained from the preceding ones by taking 
the polynomial P(T) to be J—T, that is, we are here concerned with conver- 
gence to a projection of the whole space on the fixed points of 7. For brevity 
we Shall write M, N in place of M[P], N[P] so that M is the manifold of 
fixed points and !}=(IJ—T)X. While the ergodic theorem is concerned with 
the convergence of f,(7) where f,(A) =n-!)_"-})’ we shall word the theorems 
for any sequence of functions f,€7(7) with f,(1)—1. It should be noted that 
the condition (I—T7)f,(7-)->0 in any of the various meanings of convergence 
reduces, for the particular sequence of averages mentioned above, to the con- 
dition 7"/n—0. 


3.16. THEOREM. Let f,G F(T) satisfy f,(1) and (I—T)f,(T)-0. Then 
the following statements are equivalent. 

(1) f.(T) E*=E, EX=M. 

(2) The point \=1 is either in p(T) or else a pole of R;(T). 

(3) The point \=1 is either in p(T) or else a simple pole of R;(T). 

(4) N@M=X, N is closed. 

(5) is closed. 


This("*) is a corollary of 3.6. 


3.17. THeoreM. Let F(T) satisfy f,(1) 1, (I—T)f.(T)-0, and let 
I—T take bounded closed sets into closed sets. Then statements (1), +++, (5) 
of 3.16 are all true. 


This is a corollary of 3.7. 


3.18. THEOREM. Let f,EG F(T) satisfy f,(1)—1, and let 


(4) In the original statement of this theorem the assertion was made that if A\=1 is iso- 
lated in o(T) and if | T| <M then A=1 is necessarily a simple pole of R(T). We are indebted 
to Professor Einar Hille for detecting an error in the attempted proof of this statement. 
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be compact for some integer n. Then the statements (1), - - - , (5) of 3.16 are all 
true. 


This is a corollary of 3.8. 


3.19. THEOREM. Let F(T) satisfy f,(1)—1 and (I—T)f,,(T)-30 strongly. 
Then the following conditions are equivalent. 

(1) f.(T)-E strongly, E?=E, EX=M. 

(2) f.(T)x ts weekly compact, x CX. 

(3) R@M=Z, |f.(T)| <M. 


This follows from 3.9. 

Similar theorems concerning weak convergence and almost everywhere 
convergence may be obtained from 3.10, - - - , 3.15, and these are left to the 
reader. 

It should be mentioned however in connection with the ergodic theorem 
of G. D. Birkhoff(#*), which is concerned with the almost everywhere conver- 
gence of n—)-*=1x(o"t), that the condition (3) of 3.15 is redundant. That is, 
even with no assumption concerning the measure preserving character of the 
point map ¢ other than the assertion that the strong (that is mean) theorem 
holds, we can say that n-) *-1x(o"t) converges almost everywhere if and 
only if 


n—1 


1 
(*) lim sup} — >> | < almost everywhere. 


To see this note that (*) implies 
(**) lim sup | /n| < almost everywhere. 


Clearly x(@"t)/n—>0 for a bounded function x(¢) and hence 3.14 shows 
that x(@"(t))/n—0 almost everywhere for every summable function x(t). Thus 
3.15 shows that (*) is necessary and sufficient for 

1 n—1 
(***) lim — >> x(¢"t) = E(x, #) almost everywhere. 
n ved 

Hence, one important problem remaining is to determine the conditions 
on the point map ¢ which are necessary and sufficient for (*) and hence for 
(***). This problem will be discussed elsewhere. 


(*) G. D. Birkhoff, Proof of the ergod.c theorem, Proc. Nat. Acad. Sci. U. S. A. vol. 17 
(1931) pp. 656-660. : 
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SETS OF UNIQUENESS AND SETS OF MULTIPLICITY 


BY 
R. SALEM 


1.1. Introduction. Let E be a set of points in the interval (0, 27). E is 
said to be a set of multiplicity for trigonometrical series if there exists a 
trigonometrical series not vanishing identically and converging to zero out- 
side E. If no such series exists, E is said to be a set of uniqueness. Every set 
of positive measure is a set of multiplicity, but there exist sets of multiplicity 
of measure zero. On the other hand, among perfect sets of measure zero there 
are sets of uniqueness as well as sets of multiplicity. 

Let P be a perfect set of measure zero in (0, 27) and let F(x) be any con- 
tinuous, non-decreasing, purely singular function constructed on P, that is to 
say F(x) is constant in every interval contiguous to P, but not everywhere. 

Then it is a consequence of Riemann’s classical theory of trigonometrical 
series that if the Fourier-Stieltjes coefficients of dF tend to zero, that is, if 


on = [evar = o(1) (n— 


P is a set of multiplicity. But if c,0(1) for every F(x) of the above described 
type constructed on P, it is not known whether P is necessarily a set of 
uniqueness. 

The only sets of uniqueness constructed so far are either sets of the 
“Hardy-Littlewood-Steinhaus type,” called sets of the type H, or sets which 
are the sum of a finite number or of a denumerable infinity of H-sets, called 
sets of the type H,. For these results, due to Rajchman and to Nina Bary, 
pro for the general theory we refer the reader to Zygmund [1, pp. 291- 
296 |(*). 

We shall suppose henceforth that P is a perfect set of the Cantor type 
and of constant ratio of dissection § (0<£<1/2), constructed on (0, 27). Such 
a set is obtained by dividing the fundamental interval in three parts of lengths 
proportional to , 1—2, and &, respectively, and by removing the central 
open interval. Two intervals are left on which we perform the same opera- 
tion, and so on indefinitely. (Cantor’s classical ternary set corresponds to 


§=1/3.) 
It is easy to see that any point x belonging to P is given by the formula 


where the ¢; are 0 or 1. Let F(x) be the continuous, non-decreasing, purely 


Presented to the Society, February 27, 1943; received by the editors February 2, 1943. 
(?) References to bibliography at end of this paper will appear in brackets. 
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singular function defined by F(x) = €1/2+-€:/2?+ --- +6¢,/2'+ whenx 
is given by (1.1.1). Then we have [Carleman 1, p. 225; Salem 1, p. 325]: 


2r 
(1.1.2) = f = cos rnt*-1(1 — 
0 k=l 


Instead of considering the Fourier-Stieltjes coefficients of dF (depending 
on the integral parameter 2) we can consider the Fourier-Stieltjes transform 
of dF (depending on the continuous parameter v), F being defined to be equal 
to zero in (— ©, 0) and to 1 in (27, ~). We have thus 


+00 
(1.1.3) c(v) = f = cos rvt*1(1 — £). 
k=l 

If the set P and the function F are constructed on the interval (—7/(1—&), 
a/(1—£)) instead of the interval (0, 27), we have, denoting by ® the function 


F((1 —t)x+n), 
+00 

(1.1.4) y(u) = f e“izd@ = |] cos rué*. 
aids bo 


We can interpret (x) as the distribution function which is the convolution 
of infinitely many symmetric Bernoulli distribution functions: 


®(x) = B(x/x) + B(x/wt) + --- + 


B(x) being the Bernoulli distribution function equal to zero in (— ©, —1), 
to 1/2 in (—1, +1) and to 1 in (1, + ©). With this interpretation, it is known 
(see Jessen and Wintner [1]) that & need no longer satisfy the condition 
0<£&<1/2, but must satisfy the inequality 0 <£<1:-(z) is still the Fourier- 
Stieltjes transform of the distribution function ®, the essential difference be- 
ing that if 0<&<1/2, the “spectrum” of @ is the perfect set nowhere dense P, 
and @ is purely singular; while if 1/2 <1 the “spectrum” of ® is the whole 
interval (—72/(1—&), #/(1—&)), and © can be either purely singular or abso- 
lutely continuous. 

In what follows, c,, c(v) and (u) shall always have the meaning of the 
formulae (1.1.2), (1.1.3) and (1.1.4), respectively. 

While, if we are concerned only with the behaviour of c, or of y(u) as n 
or u tends to ~, we can suppose 0 <é <1, it is plain that in the study of perfect 
sets of uniqueness and of multiplicity we must suppose 0 << 1/2. 

1.2. It has been known for a long time that the question whether a set P 
of the above-described type is a set of uniqueness or a set of multiplicity is 
deeply related to the arithmetical properties of the number & (see Nina Bary 
[2]). Actually it has been shown by Nina Bary [3] and by Kershner [1] 
that if is rational and equal to the irreducible fraction p/g, c, and y(u) tend 
to zero if #1, and do not tend to zero if p=1. This, as we have seen, implies 
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that P is a set of mnitiplicity when +1, but not necessarily that P is a set 
of uniqueness when p=1. This latter result is, however, true, and has been 
proved by Nina Bary [3] who has shown that if p=1 the set P is of the type 
H, thus solving completely the question whether P is a set of uniqueness or a 
set of multiplicity when & is rational. 

The purpose of the present paper is to give the solution of the same prob- 
lem when £ is irrational. This solution has been rendered possible due to an 
interesting work of Pisot concerning the properties of a certain set of alge- 
braic integers which we shall define presently (see Pisot [1]—the same set 
has also been considered, later on, by Vijayaraghavan [1]). We shall say that 
a is a “Pisot-Vijayaraghavan number,” or briefly a “P. V. number,” if a@ is 
an algebraic integer such that all its conjugates lie inside the unit circle 
|z| <1. The relation existing between P. V. numbers and the problem of the 
behaviour of y(u) as u—>© has first been recognized by Erdés [1], who 
proved that if 1/£ is a P. V. number, y(u) does not tend to zero for u—- ~. 
(The trivial case =1/2 is excluded.) 

Here we shall prove the following theorems: 


THEOREM I. Let G(x) be any non-decreasing function, constant in (— , 0) 
and in (27, ~), and let d, and d(v) be the Fourier-Stieltjes coefficient and the 
Fourier-Stieltjes transform defined by 


+00 
d, = f d(v) = f e**=dG. 
0 


Then, if d,=0(1) for n—~«, we have also d(v) =0(1) for v>«. (The converse 
proposition is, of course, trivial.) 

This theorem was proved first independently by A. Zygmund, whose 
proof, based on different ideas, will be published elsewhere. 

In particular we have: Let 0<£<1. If c,=0(1) for n= @, then c(v) =o0(1) 
for v= «, and hence y(u) =0(1) for u= «©. In other words, y(u) and c, both 
tend or do not tend to zero as u and m increase infinitely. 


THEOREM II. Let 0<&<1. If y(u) does not tend to zero for u= ~, then 1/é 
is necessarily a P. V. number. 

This theorem is the converse of the result of Erdés quoted above. Collect- 
ing the two results we see that c, and y(u) tend to zero if, and only if, 1/& 


is not a P. V. number. 
This proves that if 0<<1/2 and 1/é is not a P. V. number, the set P 


is a set of multiplicity. But it does not prove that if 1/& is a P. V. number, 
0<£<1/2, the set Pis a set of uniqueness. This result is the object of Theo- 


rem ITI. 
THEOREM III. Let 0<&<1/2. The set P is a set of uniqueness if (and only if) 


= 
= 


1943] SETS OF UNIQUENESS AND SETS OF MULTIPLICITY 


1/Eisa P. V. number. In this case P is a set of the type He. 


When £<1/2 approaches 1/2 the middle interval removed in the con- 
struction of the set P has a relative length tending to zero, and the Hausdorff 
dimensionality of the set, which is log 2/ | log é| , tends to 1. It is, therefore, of 
some interest to prove that we can have perfect sets of uniqueness con- 
structed with §=1/(2+e), € being positive and arbitrarily small. This is the 
consequence of the following theorem: 


THEOREM IV. There are P. V. numbers of the form 2+-¢, € being positive 
and arbitrarily small. Hence, there are perfect sets P of uniqueness for which the 
Hausdorff dimensionality is as close to 1 as we please. 

2. Proof of Theorem I. We have d(v) = /*Se**dG = {-"e"*dG and d, =d(n). 
Suppose that lim d(m)=0. If d(v) does not tend to zero, we can find a se- 
quence of numbers 2, v2, +++, Up, increasing infinitely and such that 
| d(v,p)| >6, 5 being a positive number. Let v,=",+a,, m, being the in- 
tegral part, and a, the fractional part of v,. There exists an infinite subse- 
quence {v,} of the sequence {v,} such that a, has a limit a when go. We 
have 


d(v,) = f 
0 


hence 


f enaizerizdG — d(v,) 
0 


< [G(2r) — G(0)] max | — eriz| 


€, tending to zero as g ©. Hence, for g large enough we would have 


2r 
f 
0 


But this is impossible for it is known as a consequence of Rajchman’s theory 
of formal multiplication of trigonometrical series that /?*e"* cos axdG and 
Je"e"** sin axdG tend to zero for n= @ if, as is the case, />"e"*dG tends to 
zero. (See Rajchman [2, p. 688].) Thus the theorem is proved. 

3.1. Proof of Theorem II. Suppose that 0<&<1 and that y(u) 
=|]. cos rué* does not tend to zero for 1= ©. This means that there exists 
a sequence of numbers 1%, U2, - - * increasing infinitely and such that 
|-y(u.)| >a, a being a positive number. Let 6=1/E (1<0@<@) and let 
6" <u, 56", the integer »=n, being a function of s. We can write u,=i,6" 
where 1/0<X, $1. Hence X, belongs to the interval (1/6, 1) and we can find an 
infinite subsequence {u,} of the sequence {u,} such that A, has a limit » 
(1/8@S 31) when go. Now 


| | = | | S| cos wA,-cos #A,9-cos - - cos | 


> 8/2. 


| 

a 

z 
‘ 

i 

| 

i 

i | 

if 
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hence 
(1 — sin? rd,)(1 — sin? - (1 — sin? = a? 
which implies 
exp (— sin? rA, — sin? — --- — sin? rA,0%) a?, 
that is to say 
sin? rd, + sin? + + log 1/a?. 


Let now u, be a number belonging to the sequence {ue} and let r>gq. 
We shall have 


sin? rd, + sin? + + sin? S log 1/a? 
and, as r>q, we shall have, a fortiori, 
sin? rd, + sin? + -- + + sin? S log 1/a?. 
Let now r— ©, g being fixed. We shall have, as lim A,=A, 
sin? rd + sin? + - - - + sin? log 1/a*. 


Since gq is arbitrary, this implies that the series 


(3.1.1) sin? 


converges to a sum not greater than log 1/a*. Hence: 

Let 6=1/. If y(u) does not tend to zero for u= ~, there exists a number X 
(1/6SX S1) such that the series converges. 

Once this result is proved, Theorem II is a direct consequence of the the- 
ory of Pisot. Since, however, this theory is unnecessarily general for our par- 
ticular purpose, we think it better to give here a brief account of Pisot’s proof 
for the convenience of the reader. 

3.2. Suppose that the series (3.1.1) converges. Let w,=d6* and let 
Wn =G,+p, where a, is the integer nearest to w, and —1/2<y,31/2. Then 
the sequence {aq} is a recurring sequence, in other words 


Onzk + + + dpa, = O, n= 1,2,3,---, 


where & is a fixed integer and y, be, - - - , bg are fixed rational numbers. Itis 
known that in order to prove this it is sufficient to prove that the determinant 


a a 
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is equal to zero for all nm =k. k being the smallest integer having this property, 
k is the minimum order of the recurrence. Here we can write 


— Ody = — — (ups — Oup) = — = 


hence 


a2 — 0a; — — Ody 


Gn — Onze — Gon — 


In 


m1 


Ne 


an * Nan—1 


Nn 


Nn+1 


and, by Hadamard’s theorem, 


2 n n 2 n+1 2 2n—-1 2 
Ds La) > 1). 
0 0 1 n—1 
Now the convergence of (3.1.1) is equivalent to the convergence of >> “ui. 


Let >oi'u3 = Ry. We have 
(0° + 1)(uy + 


nth 


SO + Res) S 206° + 
h h 


hence 


( > ay + 1)"RoRi- 
0 


Now, as p> ©, dg~w,=AO?, hence 


+0") 
= 0" — 1)/@ — 1) < C8", 


C being a constant depending on 6 and A only. Hence 


C[20°(0° + 1)]"RoRi- 


as R,—0 for n—«, we have D,-—>0 for n— ©, and as D, is an integer there 
exists an integer k such that D,_1+0 and D,=0 for n2k; and the sequence 
{a,} is a recurring sequence of minimum order k. 


if 
> 
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This, as is well known, proves that the power series }“a,z" represents a 
rational fraction whose denominator is 1+),;z+ -- - +d,2*. Thus 


P(z) 
= 


P(z) being a polynomial with rational coefficients. Now it follows from a 
theorem of Fatou [1, pp. 368-370, also Pélya and Szegé, 1, Aufgabe 2, pp. 142 
and 357] that if a power series with integral coefficients represents a rational 
fraction, this fraction can be written in the form P(z)/Q(z), P and Q being 
polynomials with integral coefficients, and Q(0) being equal to 1. In other 
words, in (3.2.1), & being the minimum order of the recurrence, bi, be, - - - , dy 
are integers. 
Now 


2 1 P(z) 
= — = X . 


0 0 


Since converges, the radius of convergence of is at least 1, 
Hence 1/@ must be a root of 1+),2+ - - - +d,2*=0, and the k—1 other roots 
must have moduli not less than 1. But owing to.the convergence of >> *u2, 
the function }>*u,2" cannot have a pole on the circle |z| =1. Hence the con- 
jugates of 1/@ have their.moduli greater than 1. 

It is now sufficient to consider the equation 2*+,2*-!+ - - - +d,=0, to 
see that @ is an algebraic integer such that all its conjugates lie inside the unit 
circle | z| <1, that is, 0 is a P. V. number, q.e.d. 

(The above argument shows also immediately that A is aigebraic, but we 
do not need this result here.) 

3.3. ‘Conversely, if @ is a P. V. number, there exists a number A such that 
the series (3.1.1) converges. Actually we can take \=1. For if o;, o2, , 
are the conjugates of 0, then 0*+o7+ - - - +o7_, is an integer, and if |o;| <6 
<1 fori=1,2,---,k—1, we have 


(3.3.1) lor] <8" 


where 6’ <1, if m is large enough. Hence the series py sin? 76* converges, and 
it converges like a geometric progression. 

This requires (except for the trivial case 6=2) that y(u) does not tend to 
zero for u—>«, as has been proved by Erdés [1]. The proof is immediate 
since the convergence of >>® sin? +6" requires that the infinite product 
[=o cos? +6" converges to a number A 0, unless one of the elements of the 
product vanishes. This is impossible since it would require 0*=(2k+-1)/2 for 
some positive integers m and k, which is in contradiction to the nature of the 
number 6. Hence for p large enough | [? cos? 76*>A/2 and 
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| (0?) | = | cos 70”-cos - - - cos | | cos 7/6 cos --- ) 
> (A/2)"?| cos cos - - | 


and the last infinite product is greater than a positive constant except in the 
trivial case 9=2. (The case 0*=2 with >1 is incompatible with the nature 
of 0.) 

4. Proof of Theorem III. Let now &é be such that 0<£<1/2 and let 
@=1/&. We shall prove that if @ is a P. V. number the set P defined in 1.1 
is a set of the type H,, hence a set of uniqueness. If x belongs to P, we have 
by (1.1.1) 


= — 1)[e:/0 + 62/02? |, 0 or 1. 


Let G be the set of the points y=x/(@—1). This set lies in the interval 
(0, 2x/(@—1) <2) and we have 

Let / be a fixed integer, to be chosen later on. Consider the 2’ possible 
combinations of the h digits did, - - -d, where d;=0 or 1. Let Ci, Cox, + + +, Co fs 
denote these combinations. Writing i 


(4.1) y = + - + + + + | 


consider the sequence of the ¢; belonging to one group of‘terms in parentheses. 
They form one of the combinations C2, - - - , Coa. 

Denote by G,, s=1, 2, - - - , 2*, the subset of G such that if y belongs to G, 
the combination C, occurs in infinitely many groups of terms in parentheses 
when y is written in the form (4.1). Then obviously G=G,+G.+ --- +Gn» 
(some of the G,’s may be empty). The G,’s are not necessarily closed sets. 
But as G is closed we can write, G; being the derived set of G,, 


G = Git+Gi) + + Gy). 


Now let us consider G, for a fixed s. Let m, m2, +--+, Mn, °+- be the infi- 
nite sequence of the ranks of the groups of terms in parentheses where the 
combination C, occurs. Let z be any point belonging to G,. We have 

(4.2) + + + 
+ 2 + +--+ J. 


We can write 


(4.3) 


= a(s) + B(z) + y(z) 


where a(s), 8(z) and y(z) denote the three successive brackets in (4.2). It is 
essential to observe that while 8(z) and y(z) depend on the particular point z, 
the number a(s) depends on s only but not on z. 
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Putting 6*=a,+u, where a, is the integer nearest to 6* and —1/2<y, 
31/2, we have 


B(2) = + + + 2m + + 
= 2nN(z) + 

where N(z) is an integer. On the other hand, @ being a P. V. number, we have 

by (3.3.1) (4 being supposed large enough) 

(4.5) | Br(z) | < + + | = — 8’) 

where &’ is a fixed number, depending on 0, and essentially inferior to 1. We 

have also 

(4.6) 20*™™ = + Zithmn 


(4.4) 


where 

(4.7) | | < < 
Finally, let us remark that y(z) is a point belonging to G,, hence to G. 
Let / be the length of the greatest interval contiguous to G, and let us sup- 


pose that # has been chosen such that the second members of (4.5) and (4.7) 
are both less than //8, say. Then we have by (4.3), (4.4), and (4.6) 


24N(z) + a(s) + y(z) + Bo(z) 


where 82(z) =§1(2) —Zptamn, and so | B2(z)| <1/4. This is equivalent to 
= a(s) + y(z) + Bo(z) (mod 27). 


Let us denote by G? the set of points deduced from G, by multiplication 
by the integer dam, and reduction modulo 27. Apart from the translation a(s) 
which is independent of z, the points of G? are given by the formula y(z) +£2(z). 
As | B2(s)| <1/4, and as y(z) belongs to G (whose greatest contiguous interval 
is of length J) it follows that G? has a contiguous interval of length not less 
than /—21/4=1/2, and this is true for all n. 

Now if we consider, instead of G,, the closed set G,+G; , the multiplication 
by dam, and the reduction modulo 27 will give us the closed set G?+(G?)’ 
where (G})’ denotes the derived set of G}. It is obvious that for every n, 
the set G}+-(G?)’ will have also a contiguous interval of length not less than 
1/2, and this means [Rajchman 1, pp. 403-406] that the set G,+G;/ is a set 
of the type H, hence a closed set of uniqueness. 

Now it is known [Nina Bary 1, p. 79, and Zygmund 1, p. 296] that the 
sum of closed sets of uniqueness is a set of uniqueness, hence the set 
G=>-%,(G,+G/) is a set of uniqueness: indeed, it is a set of the type H,. 

Finally, the given set P is homothetic to G and they both lie in (0, 27). 
Hence, by a theorem of Marcinkiewicz and Zygmund [1], P is a set of unique- 
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ness. Moreover, it follows from a theorem of Nina Bary [3, p. 714] that P 
being homothetic to a set of the type H, is itself of the type H,. 

This completes the proof of Theorem III. 

5. Proof of Theorem IV. In order to prove that there exist P. V. numbers 
of the form 2+e, € being positive and arbitrarily small, we shall use a method 
inspired by the work of Vijayaraghavan [1, p. 350]. Consider the equation 
f(z) =2"—22""!—1=0, n>2. As f(2)=—1, there is a root greater than 2. 
We shall prove that for | s| =1+-7 and for all 7>0 small enough (7 <0) we 
have |2"| <|2z"-!+1|. For let z=pe**. We have |z"|?=p?" and 

| + = 1+ cos (m — 1)w + 4p*-? > — 4p™-1 + 1, 
hence it is sufficient to prove that 
= p™ — 4p*-* + 4p"! — 1 < 0. 
But ¢(1)=0 and (d¢/dp),.1= —2n+4<0. Hence there exists an 70 such that 
if p=1+7 (0<<*) we have | < 22" . This, by Rouché’s theorem, 
proves that f(z) =0 has in the circle =1+7, the same number 
of roots as 22"'+1=0, that is to say »—1 roots. Hence, letting 7—+0, we see 


that f(z) =0 has m—1 roots with moduli not greater than 1. But it is easy to 
see that there are no roots of modulus 1, for that would imply 


ea — 2cos (m — 1)w = 1, 


(S.1) 
sin mw — 2 sin (n — 1)w = 0. 
Squaring and adding, we get 1—4 cos w+4=1, that is to say w=0 which is in- 
compatible with the first of the equations (5.1). 

Hence the real root greater than 2 is a P. V. number. But 


f(2(1 + €)) = 2%(1 + — 4+ — 1 = 2"e(1 — 1 > 2% — 1, 


hence taking «= 1/2" we see that f(2(1+1/2*)) is positive. Hence the P. V. 
number is included between 2 and 2(1+-1/2*) and as is as large as we please, 
this proves the theorem. 

6. Remark. In a previous paper [Salem 2, p. 71] the author has consid- 
ered symmetrical perfect sets Q of order d (d being a positive integer) and of 
constant ratio of dissection, whose points are given by the formula 


= 2n[(a/d)(1 — 8) + +--+ + 


where 0<£<1/(d+1) and the ¢; take all values 0, 1, - - - , d. (The classical 
case considered in the present paper corresponds to d=1.) It has been proved 
(loc. cit. pp. 77-79) that if — is a rational fraction p/g (irreducible), Q is a set 
of multiplicity if #1, and a set of uniqueness if p=1. 

The method used to prove this result, combined with the argument of the 
present paper, proves that Q is a set of uniqueness if and only if 6=1/£ is a 
P. V. number (@>d+1). 
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PROJECTIVE PLANES 


BY 
MARSHALL HALL 


1. Introduction. The Theorem of Desargues has played a central role in 
the study of the foundations of projective geometry. Its validity has been 
shown to be equivalent to the possibility of the introduction of homogeneous 
coordinates from a skew-field, and also to the possibility of imbedding a plane 
in a space of higher dimension. A synthetic proof of the theorem, using a third 
dimension, is relatively easy, but two dimensional proofs failed and it was 
eventually realized that projective planes existed in which the theorem was 
not valid. Non-Desarguesian planes have received attention primarily as 
curious counter-examples, proving the necessity of assuming, along with the 
axioms of connection, either the existence of a third dimension, or axioms of 
congruence('). But the full class of projective planes has a much better claim 
than this for attention. Projective planes are the logical basis for the investi- 
gation of combinatorial analysis, such topics as the Kirkman schoolgirl prob- 
lem and the. Steiner triple systems being interpretable directly as plane 
configurations, various problems in non-associative or nondistributive alge- 
bras, and the problems of universal identities in groups and skew-fields, as 
well as certain problems on structure (lattice) theory and the theory of par- . 
tially ordered sets(?). 

The author has made an attempt to investigate projective planes as sys- i 
tematically as possible. A considerable portion of these investigations is not 
included here either because of the incomplete and unsatisfactory nature of 
the results or because of lack of generality. War duties have forced the post- 
ponement of the completion of this work. The investigations here proceed 
along two main lines. §§2, 3, and 4 are concerned with methods of con- 
structing the most general projective planes, and demonstrating the extent 
of variety which exists in the class of projective planes. In §4, the free planes, 
comparable in certain ways to free groups, are studied. Free planes, generated 
by finite configurations, are characterized (Theorems 4.8 and 4.13) and a 


Presented to the Society, September 10, 1940; received by the editors October 22, 1942. 

. @) Certainly this is Hilbert’s attitude. See D. Hilbert [1]. This reference and all others in 
the paper are to the bibliography at the end of the paper. 

(?) In a recent Colloquium publication it was said “The study of plane projective ge- 
ometries takes us into one of the most mysterious and fascinating parts of combinatory 
analysis—a subject concerning which little is known.” On the Kirkman problem, see T. P. 
Kirkman [1], E. W.. Davis fi ], and the bibliography on the problem, O. Eckenstein [1]. For 
triple systems, see A. Emch [1]. Geometric algebras related to quantum theory are investigated 
in P. Jordan, J. von Neumann, and E. Wigner [1]. F. Levi [1] applies the theory of configura- 
tions to combinatorial topology. 
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canonical set (Theorem 4.12) is determined. As a corollary it follows that 
the subplanes of a free plane are free planes. This analogy with free groups 
does not seem to go so far as to enable us to consider an arbitrary plane as a 
homomorphic image of a free plane. Certainly this cannot be done by a lat- 
tice homomorphism, and no clearly defined homomorphism of partial order- 
ing is determined. If, as a consequence of identifications, we find two different 
points lying on two different lines, we must identify the points or identify 
the lines, but there is no clear reason for preferring one identification to the 
other. This fundamental difficulty has no analogue in the corresponding prob- 
lem for groups. 

§§5 and 6 proceed in a direction different from, but in a sense comple- 
mentary to, the direction of the earlier sections. The earlier sections are 
concerned mainly with the geometry of open configurations, the later ones with 
the geometry of confined configurations. (The definitions of these two terms 
immediately precede and follow Theorem 4.8.) An effort is made to provide 
an algebraic substitute in the general plane for the coordinatizing skew-field 
in Desarguesian planes. Two related systems of coordinates are introduced, 
one a system of permutations (essentially projections), the other a ring de- 
fined by a ternary operation. It is perhaps unfortunate that the permutations 
force two points into a special role and that the ternary ring places a single 
quadrilateral in a special role. Theorems on configurations are associated with 
algebraic relations in these systems. The algebraic consequences of these 
relations are of great importance in characterizing the geometric conclusions 
to be drawn, and enable us to determine the geometric dependence or inde- 
pendence of configuration theorems. Many algebraic problems and new types 
of algebraic systems arise from natural geometric questions, and the fullest 
investigation of these is called for. The same geometric system (such as a web 
or a complete plane) may be associated with two or more non-isomorphic 
algebraic systems, each of which may be constructed from any other by cer- 
tain rules. (See rule (5.12) in Theorem 5.9 and the examples in appendix II.) 
This geometric equivalence, stronger than algebraic isomorphism, is a valu- 
able algebraic tool, comparable to the theory of classes of algebras. 

2. The axioms of projective geometry: The projective plane. Hilbert(*) 
who was as much interested in metric geometry as in projective geometry 
bases his studies of the foundations of geometry on five sets of axioms: 
I. Axioms of connection, II. Axioms of order, III. Axioms of parallels, 
IV. Axioms of congruence, and V. Axioms of continuity. Of these only I 
and III define abstract Euclidean space and the introduction of ideal ele- 
ments at infinity reduces this space to a projective space. Axioms II, IV, and 
particularly V restrict the coordinate skew-field (given by the von Staudt 
construction using I and III) to the real number field. Veblen and Young(‘), 


(*) D. Hilbert [1]. 
(*) O. Veblen and J. W. Young [1]. 
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being concerned solely with projective geometry, proceed somewhat differ- 
ently to the same end. More recently Garrett Birkhoff(*) has defined projec- 
tive geometry as an irreducible complemented modular lattice (comple- 
mented Dedekind structure) and shows the equivalence of his axioms to those 
of Veblen and Young. 

The principal result of these investigations is that the points of a projec- 
tive space of three or more dimensions, which is necessarily Desarguesian, or 
a Desarguesian plane may be coordinatized by the von Staudt construction 
with elements from a skew-field, and the (k—1)-dimensional subspaces of a 
k-dimensional projective space contain those points which satisfy a linear 
equation. The skew-field will be a commutative field if and only if the Theo- 
rem of Pappus holds, while further conditions are required if it is wished to 
restrict the field to the field of real numbers. 

Coming to the structure of projective spaces of less than three dimensions, 
the void space or a point can hardly be said to have any structure, while a 
line is completely determined by the number of points on it. This leaves only 
the non-Desarguesian planes, whose structure is non-trivial, to be charac- 
terized by algebraic means or otherwise. 

Before proceeding to a study of projective planes it seems desirable to 
axiomatize the plane as simply as possible. The following axioms may be used: 


Pi. Any two distinct points are contained in one and only one line. 
P2. Any two distinct lines contain one and only one point in common. 
P3. There exist four points, no three of which are contained in one line. 


The proof that these axioms define a plane in exactly the sense of Birkhoff 
or Veblen and Young is quite simple and will not be given here. P3 serves the 
purpose of axiom EO of Veblen and Young and might easily be replaced by a 
number of equivalent conditions. It seems more elegant to the author to re- 
quire the existence of only four points of a specified kind rather than the 
infinity of conditions in axiom EO. 

The removal of a line and the points on it from a projective plane 7 leaves 
a Euclidean plane +* whose points and lines satisfy the following axioms: 


EP1. Any two distinct points are contained in one and only one line. 

EP2. Two distinct lines contain at most one point in common. 

EP3. The plane contains four points, no three of which are contained in one 
line. 

EP4. Through a point P not contained in a line L there is exactly one line M 
which has no point in common with L. 


From EP4 two lines which are parallel to (that is, do not intersect) a 
third line are parallel to each other. If to r* we add a point of intersection for 


_ @® Garrett Birkhoff [1]. Also K. Menger [1, 2]. These papers achieve independently vir- 
tually the same formulation. 
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each system of parallels and then add a single line containing all these new 
points we obtain a projective plane 7. Clearly the Euclidean plane * deter- 
mines 7 uniquely. 

We may include as planes the systems satisfying the axioms P1 and P2 
but not P3. These are easily found to be 


1. the void plane, 

2. a plane containing a single point and no lines, 

3. a plane containing a single line and no points, 

4. a plane containing a single point and one or more lines through the 
point, 

5. a plane containing a single line and one or more points on the line, 

6. a plane containing a single line and a single point not on the line, 

7. a plane containing a point P on a line Z with one or more additional 
lines through P and one or more additional points on L, 

8. a plane containing a line LZ, and one or more points on L, a point P 
not on L and one or more lines through P joining P to the points of L. 


These systems will be referred to as degenerate or improper planes. Note 
that every plane in which there are enough points on each line contains all 
these types of degenerate planes. In defining proper planes P3 may be re- 
placed by any number of other axioms which exclude these degenerate cases. 

Why is it that projective spaces of three or more dimensions are suffi- 


ciently regular to be coordinatized by skew-fields while projective non- 
Desarguesian planes exist which cannot be so coordinatized? One way of 
answering this fundamental question is the following: In a space configura- 
tion of three or more dimensions the identification of a line, constructed as 
the intersection of two planes, with a line, constructed as the union of two 
points forces the identification of further constructed elements and the es- 
tablishment of non-trivial configurations such as the Desargues configuration, 
while in the plane a line may be constructed only as a union of two points, a 
point as the intersection of two lines and there are no forced identifications. 
Consider a proof of the Theorem of Desargues for the case in which the tri- 
angles are not coplanar. (See Fig. 1.) 

Given: Triangles A,B,C;, A2B2C2, not in the same plane, and A1A>, B; B2, and 
CiC2 passing through the point P. ; 

To prove: That A,B; and AiC; and A2C2, and B,C, intersect in 
points C3, Bs, and As; which are collinear. 


Proof. A:B; and A,B, both lying in the plane PA,B, must intersect in a 
point C;. Similarly A1C, and A2C; intersect in a point B; and B,C; and B,C, 
intersect in a point As. But A;, B;, and C; must lie in both the plane of 
A,B,C, and the plane of A2B,C, and hence on m, the line of intersection of 
these two planes. Hence the lines A3B;, A3C;, and B;C; must coincide with 
this intersection and the points A;, B3;, and C; are collinear. 
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Here the kernel of the proof lies in the identification of AsB3, A3C3, and 
B;C; with m and hence with each other. As we shall see in the next section 
there are no such forced identifications in the plane. 


Fie. 1 


3. The generation of projective planes. Given any set of points and lines 
in a plane, we may obtain further points and lines of the plane by constructing 
the lines joining points of the set and by finding the points of intersection 
of lines of the set. We may repeat this process on the enlarged set and con- 
tinue as often as we please. All points and lines obtainable in this way are 
said to be generated by the given set. For example, all points on a line and 
two points not on the line generate the entire plane in four steps. But obvi- 
ously the entire plane may be generated by the two points and only part of 
the points on the line or in various other ways. The rational plane (rational 
net in Veblen-Young) is generated by any quadrilateral in it. 

We may use the method of generation to construct all possible projective 
planes. 


DEFINITION. A partial plane is any system of points and lines such that: 
pl. There is at most one line through any two distinct points. 
p2. There is at most one point common to two distinct lines. 
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A partial plane is degenerate if it is contained in a degenerate plane. A 
partial plane is proper if it generates a proper plane. This it will do if and only 
if it contains one of the following: 

. four points, no three collinear, 

. four lines, no three copunctal, 

. three non-collinear points and a line not through any one of them, 
. three non-copunctal lines and a point not on any one of them, 

. a triangle of points and lines not sides through two of these, 

6. a triangle of lines and points on two sides of it. 

A representation of a partial plane 7 is a table giving a list of the lines 
(Li, - - - ) of w and listing after each line Z; points (A, - - - ) of contained 
in Z;. Such a representation is said to be explicit if it is to be understood 
that the points and lines are distinct as listed and that no point of lies 
on any line of 7 unless it is so listed. We say that m2 is complete in m if 7172 
and any point of intersection of lines of 72 which lies in 7 also lies in 72 and 
if every line joining two points of 72 which lies in 7; also lies in m2. A plane 
satisfies the axioms of a partial plane and may also be represented as a partial 
plane. A partial plane 7; is said to be isomorphic to a partial plane 7:2 if and 
only if there is a 1-1 correspondence between a representation [L', A'] of m 
and [L?, A?] of 2 such that if A'CL' and L'@L*, A'=A? then A?CL? and 
conversely. Isomorphic partial planes will not in general be considered dis- 
tinct. 

A representation of a partial plane is 


: A2BCs, BC2Asz, 
: AyCiBs, : AsBs, 
: AsCoBs, : ACs, 
A,B,C3, : B,CiAs3, : BsC3. 


If P;, - - - , P, are points no two of which are joined in the representation 
of a partial plane, we may extend the representation formally by adding a 
line L: P,, - - -, P, to the representation, or if Zi, - - -, Z, are lines no two 
of which intersect, we may extend by adjoining a point P to each of 
Ii, ---, L,. The larger table is a representation of a partial plane which 
contains the original partial plane but which now contains a line Z joining 
P,, +--+, P, or in which Z,, - - - , Z, now intersect in P. This method of ex- 
tension may be used to imbed any partial plane in a complete plane. 


THEOREM 3.1. Given a partial plane m, and a representation R, of ™. If ™ 
is degenerate, then any plane with enough points on each line contains m ex- 
plicitly. If m, is nondegenerate, there is a complete plane w generated by m in 
which the representation of m, by R; is explicit. 


(3.1) 


Proof. It has already been noted above that a degenerate plane is con- 


Li: 
Le: 
Ls: 
Le: 
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tained in any plane with enough points on each line. The representation of 
a degenerate plane is necessarily explicit. 

Given a nondegenerate partial plane ; and a representation R, of 71. 
Form sets S;(Pi,, - - -, Pi,) of: points of 2; such that: 


(1) Every pair of points P!, Q' of 2; not joined by a line of 7: is in exactly 
one set S. 
(2) No pair of points P', Q' in a set S is joined by a line of 7. 


We may certainly define such sets in at least one way. We may take each 
set S with only two points in it, and include a point pair P, Q as a set if 
and only if P and Q are not joined in 7;. We now define a partial plane 72 
by a representation R, in which there are points P! and lines L®, L?. P! and L°® 
are the points and lines of R; and P'CL® in R; if and only if P'CL® in Ri. 
For each set S,(Pi,, - - - , Pie) we define a line L? in R, and say that Pi,CLi, 
j=1i,---,r7, and P!(L? otherwise. Then R; is a representation of a partial 
plane 72 containing 7m explicitly. Every pair of points in 7 is joined by a 
line in 72 if not already joined in 7;. Similarly we may extend R; to a represen- 
tation Rs; of a partial plane 7; which contains 7, explicitly and such that 
every pair of lines in 72 intersects in a point of 7; if the pair does not already 
intersect in a point of 72. 

We may continue this process a denumerable number of times. For each 
R2, of a 72, we form sets of two or more nonintersecting lines as was done for 
R; and use these sets to define new intersection points in a representation 
Ronis Of a Ten41 Containing 72, explicitly. For each of a we form sets 
of two or more points which are not collinear in Re,,; and use these sets to 
define new lines in a representation Re,42 containing Re»; explicitly. Every 
incidence of points and lines in a representation R; will occur in a representa- 
tion R;, 7>¢, and no incidences of points and lines of R; will occur in R;, 
j>t, except those already in R;. Requirement (1) of our extensions assures 
us that we do not adjoin two new intersection points to a pair of lines, or 
two new lines through a pair of points. Requirement (2) assures us that we 
do not adjcin a new intersection point for lines which already intersect or a 
new line through points already joined. Hence every R, is a partial rlane. 

Now define a representation R of points and lines P, Z occurring in any 
R,, n=1, 2, ---, and let P‘CL/ in R if and only if P‘CL/ in some R,. This 
determination of incidence is unambiguous, for it is determined in Rmaxc,j; 
the first R, containing both P‘ and L/, and is not altered in any succeeding R,. 
Hence R contains R; explicitly, and satisfies the requirements for a partial 
plane. Every pair of lines of R2, has an intersection either in Re, or in Res41, 
and every pair of points in Re4: is joined by either a line of Re.41 or by a 
line of R242. Hence R is not only a partial plane but is a complete plane. 

It is in the sense of this theorem that it was said in §2 that there are no 
forced identifications in projective planes. For example, (3.1) is a representa- 
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tion of a partial plane which may be extended to a complete plane without 
forcing the identification of Zio, Zu, and Ly as would be necessary in any 
Desarguesian projective space. 

The consequences of this theorem are numerous when the very general 
nature of a partial plane is taken into account. Let us call a subplane r’ of a 
plane 7 a set of points and lines which satisfy P1, 2, 3 or merely P1, 2. Now we 
may take as a partial plane any set of disjoint planes and generate a plane 
containing these as subplanes. 


THEOREM 3.2. An arbitrary set of planes may occur among the subplanes of 
a plane. 


The subplanes of a plane form a structure-lattice if we take the crosscut 
™( \m2 of subplanes 71, 72 to be the points and lines common to both subplanes 
and the union 7\/m, to be the subplane generated by 7 and 72. Here it is 
necessary to include degenerate subplanes. Unfortunately the structure-lat- 
tice of subplanes need not be particularly regular. At least it is very easy to 
construct a plane in which this structure-lattice is not a Dedekind structure 
(modular lattice). Let us take a 7-point plane 7 and a 13-point plane 72 with 
no points or lines in common and let a line of 7 intersect a line of m2 in a 
point P. Now construct a 21-point piane 7; generated by m™ and P and let 
the partial plane of 72 and 73 be extended to a complete plane z (necessarily 
larger than 73). We now have the following diagram from the structure- 
lattice. Here 73m and =m, and the sub- 
plane structure-lattice is certainly not modular. 


O 
Fic. 2 


The great generality of projective planes has so far been demonstrated 
in showing that (1) the configurations in different parts are independent and 
(2) the structure-lattice of subplanes is not greatly restricted, at least in any 
familiar way. 

4. Free extensions and free planes. The simplest way of extending a par- 
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tial plane is to let each new point adjoined be the intersection of only two old 
lines and each new line join only two old points. Such an extension will be 
called a free extension. 

From the method of constructing free extensions we have immediately the 
following theorem: 


THEOREM 4.1. The free extension of a partial plane to a complete plane is 
unique. 
For two free extensions of a partial plane to a complete plane can differ 


only in the order in which new elements are added. Every new element is 
constructed from poirts and lines of the given partial plane in a single way. 


THEOREM 4.2. If 12 is a free extension of m, and 1; is a free extension of 2, 
then 7; ts a free extension of m1. The free extension of m2 to a complete plane ts the 
free extension of ™ to a complete plane. 


THEOREM 4.3. If 71) 72 and m2 is complete in m then the free extension mi 
of ™: contains as a subplane the free extension mi of m2. If m2 is a proper part of ™, 
then m2 is a proper part of mi. 


Since m2 is complete in 7 its extension will include only new elements. 

We may call the free extension of a partial plane 7; to a complete plane 
simply the free extension of 7. The free extension of »—2 points on a line 
and 2 points not on it will be written 7*. 


THEOREM 4.4. The free extension of any non-degenerate partial plane ™ 
which is not a complete plane contains a r* as a subplane. 


Proof. Since ™ is not a complete plane and is nondegenerate, it contains 
either (a) a pair of points P;, P2 not joined by any line or (b) a pair of lines 
I,, Lz which do not intersect. In case (b) suppose first that not every point 
of 7; is on LZ; or Lz. Let such a point be P; and adjoin as P;, the intersection 
of ZL, and Zz. Here P; and P: are two points of a free extension of 7; not 
joined by a line. Second if every point of 7; is on Z; and Lz and there are no 
further lines there must be at least two points each on L, and Ly since 7 is 
nondegenerate. If these are not all joined in pairs by lines take P; and P2 
as a pair not joined. If they are joined in pairs adjoin a point P, as intersec- 
tion of two of these lines not intersecting on LZ; and Lz. Third suppose here P; 
and P, are the points of a free extension 7; and case (b) is reduced to case (a). 

Now take P,; and P2, two points of m (or a free extension of 7) not 
joined by a line. Suppose that m7 contains another pair of points P; and P, 
which are not joined. Then extending 7, if necessary, we have: 


(4.1) Ls: P,P3Bz, Ls: P,P.Bs;, P2P3B3, Ls: B-Bs3. 


This partial plane is necessarily complete in 7; (or 7; extended) since neither 
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P, and P, nor P; and P, are joined. Hence by Theorem 4.3 the free extension 
of 7; contains the free extension of (4.1). But (4.1) is a free extension of the 
four points P;, P2, P3, Ps and hence by Theorem 4.2 the free extension of 7 
contains a 

In any case ™, being nondegenerate, or a simple extension not involving 
joining P, and P2, must contain two points A; and A: which together with 
P; and P; form a quadrilateral. We may now suppose without loss of general- 
ity that 7; is generated by P;, P2, A1, Az and contains 

Ay: Hs: P2A;B3, Hs: 


(4.2) 
He: Az: P.A2B2, Hs: BoB3. 


Here the points and lines are distinct but (4.2) need not be complete in ™ 
since Hs and H, may intersect in ™. Now adjoin Ho: P,P: and the intersec- 
tions of Hy with H; and H, to giving Ho: Pi P2CiCe Hz: A,;A2Ci He: B2BsCo. 
Now C, does not join B; nor does C; join A:. Hence C;, Bz, and C2, Ai may play 
the role of P;, P: and P3, Py in (4.1) and so the free extension of 7 contains 


THEOREM 4.5. A 2", n24, contains a 
Proof. Let * be generated by 
A partial free extension of (4.3) is 
(4.4) Le: Ls: Mz: BiBCxC,, 
Ls: Le: Ans M;: 
Ni: CiCs. 
This contains the partial plane complete in (4.4), 
(4.5) Le: AyAsC3, Ls: Le: Ni: CiCs, 


which is a free extension of 
(4.6) Ni: CiCs3, Le: A3A »An, 


and so x* contains the free extension of (4.5) and hence of (4.6) which is 
a 

Now in (4.4) the triangles A243A, and B,B,B; are perspective with re- 
spect to A; and the intersections of corresponding sides are C2, Cy, and Cz. 
But in the free extension of (4.4) these three points are not collinear. Hence 
am", n=4, is never Desarguesian and in consequence of Theorem 4.4 we have 
the following theorem. 


f 
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THEOREM 4.6. A free extension of a partial plane is never Desarguesian, pro- 
vided the partial plane is nondegenerate and not a complete plane. 


If from a partial plane 7, we remove a point which is on exactly two lines 
of m2 or a line which contains exactly two points we obtain a partial plane ™ 
of which 7: is a free extension. This removal, or any succession of such re- 
movals, will be called a free contraction. If one partial plane may be obtained 
from another by any succession of free extensions or free contractions we say 
that the two partial planes are free-equivalent. It is easily shown that this is a 
true equivalence, being reflexive, symmetric, and transitive. 


THEOREM 4.7. A free contraction followed by a free extension may be replaced 
by a free extension followed by a free contraction. 


Proof. It is sufficient to prove this when there is only one step in the con- 
traction and one in the extension. If we remove a point P and adjoin a line L 
we may first adjoin Z and then remove P, since P was not used to define L, 
and the two lines which intersected in P have not been altered, nor does L 
pass through P. If we remove a point P and adjoin a point 0#P, we may 
first adjoin Q and then remove P, since the two lines for which Q is an inter- 
section do not intersect before the removal of P. If we remove P and then 
adjoin Q=P, we may simply omit both steps or adjoin any element and then 
remove it. The situation is similar if the element first removed is a line. 

A finite partial plane which has three or more points on every line and 
three or more lines through every point is said to be confined. Free contraction 
is not possible in a confined partial plane, or confined configuration as we 
may call it. 


THEOREM 4.8. A free plane does not contain any confined configuration. 


Proof. The generating partial plane (4.3) of a free plane 7* does not con- 
tain any confined configuration since there are only two lines. Hence a con- 
fined configuration a must contain elements of later stages. Consider an 
element of the last stage occurring in a. Suppose it to be a line L of the ith 
stage. The points on it must be of earlier stages. But in a free plane an ith 
stage line contains only two points of earlier stages, namely those defining it, 
and we have acontradiction. Similarly if an element of the last stage in aisa 
point we have a contradiction. Hence 7 contains no confined configuration. 

We shall call a finite partial plane which contains no confined configura- 
tion an open configuration. Here is an example of an open configuration in 
which free contraction is not possible, but which becomes quite simple under 
free equivalence. 

In: A;sA4Bsz, Ls: 


(4.7) Iy: AsAsBs, Ls: 
Li: 


qj 
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By free extension 

Iy: A2A3B3D,DD3, Li: B2BsCs, 

Ia: A;A3Bz, Ls: M,: AC,D,, 

Is: M2: A;C2D2, 
Ms: A 1CaD3. 


(4.8) 


By free contraction we may remove first Ci, C2, C3 then Mi, Mo, M3, Li, Le, Li 
and then Bs; and and leaving 


(4.9) Ls: A 1A 4) La: AoA 3D,D2D3, 
the open configuration generating 7’. The general problem of free equivalence 
is treated in the following theorem. 


THEOREM 4.9. A nondegenerate open configuration is free equivalent to a 
configuration (4.3) which generates a x”. 

Proof. Given an open configuration which is nondegenerate. Extending, 
if necessary, we may find a quadrilateral A1, A2, As, Ag and hence by free 
equivalence we will have a configuration: 


(4.10) Le: A3A4(C)) 


where the intersection C; of L; and Lz may or may not occur, and the points 
B,, - - - , B, are on no line. We shall reduce the configuration in various ways 
by free equivalence, by reducing the number of lines, or keeping the number 
of lines fixed, always retaining two lines such as Z; and Lz containing a quad- 
rilateral. This reduction process must come to an end in a finite number of 
steps and we shall show that the end is not reached as long as there are three 
or more lines, and then treat the cases with two lines. 

First if there is a line Z with no points, intersect ZL with LZ; and Lz and 
then remove L. If there is a line L with exactly one point not equal to 
A, Az, Ci, intersect L with ZL; and then remove L. If there is a line L with 
exactly one point, either A; or Az, intersect L with Zz, and then remove L. 
If there is a line L with exactly the one point C; adjoin M: A1A; if necessary, 
intersect with L and then remove L. Naturally we remove any lineLs, - - - ,L, 
with exactly two points. Hence we may suppose that either a reduction is 
possible or that each of the lines Ls, - - - , ZL, contains at least three points. 
Now if some line, say Zs, contains a point As#A1, Ax, As, Ag which occurs 
exactly once, then Z; cannot contain both A; and Az, and say it does not con- 
tain A;. Then M: AA; will be a new line. By free extension (4.10) becomes 
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A,A2(C)) --- [Bi,---, Bl, 
(4.11) : A3A,(C;)--- Di, 
AsAe:--, , : AjA5D,. 


By free contraction it becomes 
Ty: A,A2(C)) B,|, 


(4.12) A3A4(C;) D,, 
Ls: Ag-:+, ,Ly: 


Hence a point A; occurring exactly once on L; may be replaced by a point 
occurring exactly once on Lz (or similarly on Z;). Since this process is clearly 
reversible, we may shift the points of single occurrence at will and we shall 
suppose that they are all on one line, either Z; or Lz according to preference. 
Note that this does not involve any increase in the number of points. We 
remove any point occurring exactly twice, except one of A1, As, As3, As. If 
one of these occurs exactly twice, suppose this to be A; occurring a second 
time on Z; and that the points of single occurrence are on Lz. Now Ls; cannot 
contain C; and if it contains neither A; nor A,, then Ls: A1AsA4¢6--- where 
As, Au, As, Ae form a quadrilateral and we may reduce by dropping A; and 
keeping Le, Ls in the role of Z;, Ze. If Ls contains one of A3, A, say Az, then 
L3: A1A3As ~:~ and this reduction fails. Here if C,; occurs, then in its third 
occurrence L4: C:A¢A7 - - - and we may take A3, Aa, As, Az as a quadrilateral 
on Lz and LZ, and reduce by dropping A. If C; does not occur, then As occurs 
on at least two lines besides LZ; and at least one of these is not through A,, 
say Ls: AsA¢ - - -. Here we may take As, A, As, Ae as a quadrilateral and 
drop A, for a reduction. Hence if there are more than two lines and (4.10) 
is not reducible by one of these devices, we may suppose that every point 
occurs either once or at least three times, that the points of single occurrence 
are on L, or Le, and that each of Ls, - - - , Z, contains at least three points. 
Since there are at least three points on ZL; and since each of these occurs at 
least three times, there must be a fourth line to yield enough incidences. 
Now since ZL; and J, have three or more points each, points occurring three 
or more times must contain a quadrilateral of points each occurring three or 
more times. Hence shifting from A1, A2, As, A, to this quadrilateral, if neces- 
sary, we may assume that A1, As, A3, A, each occurs at least three times in 
(4.10) or that further reduction is possible. Now if in (4.10) we suppress all 
points of single occurrence, the remaining configuration must, by hypothesis, 
be open. But this leaves Zs, - - - , ZL, with at least three points each, Z; with 
at least A;, Az and LZ» with at least A3, A, and every point occurring three or 
more times. Hence one of Z;, Le, say Zi, can have no more than the two 
points A;, As. Now in (4.10), without suppression of points of single occur- 
rence, the single points may be supposed shifted to Le. Since there are no 
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points of double occurrence, this leaves Z; with exactly the two points A1, As. 
Ls; and JZ, will certainly contain a quadrilateral and so we may reduce by 
dropping Z; and still keep two lines with a quadrilateral. This completes re- 
duction of (4.10) by free equivalence to a configuration with two lines con- 
taining a quadrilateral, 

(4.13) qi: [B:, B, |, Le: A3A,4 eee 
where C; has been removed if it occurred. By free extension 

(4 14) Li: B,|, Ls: A;ByA u41, 

Te: + Ly: u42- 


By free contraction we remove B, and then L3, Ly, yielding 
(4.15) Ly: [Bo,---, Bel, Lz: 
Thus each B may be replaced by two A’s on LZ, until we have 
(4.16) AsA2D,--- Ds, Av. 
Now we may use a free extension 
Li: Di, L3: 


Ls: DrEyA 


where by free contraction we may remove D,, then Ls, then E,, and finally 
L; and J, leaving 


(4.18) Ty: Iz: A 


In this way we may replace every D by another A on Ly» reaching 
(4.19) Li: Le: 


which we may call the free -point. This completes the proof of the theorem. 
In any configuration let P be the number of points, Z the number of lines, 
and J the number of incidences. 


THEOREM 4.10. The number 2(P+-L) —I is invariant under free equivalence. 


Proof. In free extension, adding a point we increase P by 1 and J by 2. 
Adding a line, we increase L by 1 and J by 2. In free contraction we decrease 
by similar amounts. In all cases 2(P+L)—J remains unchanged. 


THEOREM 4.11. Am open configuration a which is nondegenerate is free 
equivalent to the free n-point if and only if 
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Proof. By Theorem 4.9, a is free-equivalent to some free m-point and by 
Theorem 4.10, 2(P +L) —J is invariant under free equivalence. Now for (4.19) 
P=n, L=2, Il=n and 2(P+L) —I=n+4. 


THEOREM 4.12. A free n-point and a free m-point are not free equivalent if 
The planes and are not isomorphic. 


An immediate consequence of Theorems 4.10 and 4.11 is that the free 
m-point and the free -point are not free equivalent if mn. If 7” and 2” 
were isomorphic we could consider both as the same plane generated in one 
way by a free m-point and in another by a free m-point. Let C, be a configura- 
tion generated by the free m-point large enough to include the generating free 
n-point. Then extend C; by generating from the free n-point so that the gen- 
erating free m-point is constructible from the free m-point. Call the extended 
configuration C;. Now from its method of construction C,; may be considered 
as generated either by the free m-point or by the free n-point. By assumption 
the whole plane is the free extension of both the m-point and the 1-point. 
Hence by free contraction C; is free-equivalent to both the m-point and the 
n-point, and they are free-equivalent to each other. But this is a contradiction 
and hence x” and * cannot be isomorphic. 


THEOREM 4.13. A plane containing no confined configuration and generated 
by a finite configuration is a free plane x". 


CoroLiary. The finitely generated subplanes of a free plane are free. 


Proof. Let 7 contain no confined configurations, and be generated by a 
finite configuration a. In the extension of a the free adjunction of a point or 
line leaves 2(P+L)—TI invariant, while the adjunction of a point on three 
or more lines or a line through three or more points decreases the value of 
2(P+L)—I. Now a configuration for which 2(P+L)—IsS7 cannot by Theo- 
rem 4,11 be an open configuration. Hence in any (finite) extension of a in 7, 
2(P+L)—Z28 and consequently in generating 7 from a@ there are only a 
finite number of adjunctions which are not free. Hence 7 may be considered 
the free extension of some finite extension 8 of a. But 8 is an open configura- 
tion and hence a finite extension y of 8 is a free extension of a free n-point 
and hence 7, as a free extension of , is a free plane x”. 

It is conceivable that Theorem 4.13 would not be true without the assump- 
tion of the finiteness of a. It is possible that the generators of a plane might be 
so scattered and so interrelated that it might be impossible to find points on 
a single line and two points off it which would generate the entire plane with- 
out any relation between them. 

It is easy to prcve by the method of Theorem 4.8 that a free extension 
of a partial plane contains no confined configuration except those included 
entirely in the partial plane. Thus any collineation or duality of the free 
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extension of a confined configuration maps the confined configuration into 
itself. In this way it is not too difficult to construct planes which are not self- 
dual, planes with only the identity as a collineation, and so on. Further in- 
vestigation along these lines might be rewarding. 

5. Introduction of coordinates. Classical investigations(*) have shown that 
in a Desarguesian plane we may introduce homogeneous coordinates from a 
skew-field and thereafter proceed by the methods of analytic geometry. For 
a given skew-field F, we denote points by P(Ax1, Axe, Ax3), X1, X2, Xs not all 
zero, \¥0 arbitrary. We denote lines by L[aic, azo, a3], a1, a2, @3 not all zero, 
and ¢#0 arbitrary. Here PCL if and only if 


(5.1) 410, + + = 0. 


Given any skew-field F it is not difficult to verify that points and lines so 
defined form a Desarguesian plane. We shall distinguish between an analytic 
plane and a Desarguesian plane until the equivalence has been shown here. 
It is not the purpose of this paper to duplicate this classic result, but it will 
be incidental to other investigations. 

In an analytic plane it is easy to verify that the projective transformations 


px, = + + bisx3 , 
(5.2) px2 = + + 
+ bsoxd + bssxs, 


with p#0 and the matrix (0;;) of rank three, form a group of collineations and 
that by an appropriate collineation the vertices of any quadrilateral may be 
taken into A(0, 1, 0), B(1, 0, 0), C(O, 0, 1), and D(1, 1, 1). In the following 
section we shall select an arbitrary quadrilateral in any projective plane + 
and introduce coordinates with respect to which it plays a role similar to that 
of ABCD in the analytic plane. 

Let us select any two distinct points A and B in the projective plane z. 
If from 7 we remove the line L,, joining A and B and the points on L,, the 
remaining plane 7* is a Euclidean plane. We shall use A and B as centers of 
perspectivities to introduce coordinates for the points of r*. No ambiguity 
will arise if, as we shall find convenient on occasion, we speak of r and 7* 
as the same plane so long as L,, is fixed. 

Let the lines of the pencil through A in r* be denoted by x =%o, - - -,X=Xn-1 
where ” is their cardinal number and %o, - - - , X,-1 are m different symbols. 
Similarly denote the lines of the pencil through B by y=yo, -- +, y=¥n-1. 
(The cardinal number of the y’s is necessarily the same as that of the x’s.) 
Then through every point P of x* there is exactly one line x = x; and one line 
y =y; Denote P by (x;, ys). As in Figure 3 we see that on an arbitrary line L 
of z* not through A or B there are points (x;, y;) where each x and each y 


(®) D. Hilbert [1], K. G. C. von Staudt [1], vol. 2, pp. 166 et seq. 
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occurs exactly once. Henceforward suppose that the symbols xo, «+ + , Xn—1 
are the same as yo, - - - , Yn, though not necessarily in the same order. Then 
an arbitrary line Z not through A or B is associated with the permutation 
(2. 85) where the (x;, ys) are the points of L. This expresses the fact (see 
Figure 3) that Z determines a 1-1 correspondence between the lines of the 
pencil through A and the lines of the pencil through B. Without confusion 
we may write 


(5.3) 
since distinct lines contain at most one point in common and hence are asso- 
ciated with distinct permutations. 


A 


(x1, y1) 


Fic. 3 


THEOREM 5.1. The projection L—,4M-—.sL induces the permutation 
(xi, vs) —>(xj, ys) Of the points of L where = ML-. 


Proof. The perspectivity L-,M takes (x;, y;) of L into (x;, 2;) of M and 
the perspectivity M—2L takes (x;, 2;) of M into (x;, y;) of L where y;=2;. 
But and where and hence ML-! 


soe, 


We have shown that in an arbitrary Euclidean plane 7* the points may 
be represented by (x;, y;) and the lines by x =c;, y=c; and permutations L 
of m symbols ¢o, - - - , Cn-1. What are the properties that characterize the set S 
of permutations [L]? This question is answered by the following theorem. 


THEOREM 5.2. A set S of permutations on n letters co, - ++, Cas will be 
associated with the lines of a Euclidean plane x* not through its centers of per- 
spectivity if and only if (a), (b), and (c) or (c’) hold. 


— 
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(a) S is doubly transitive on Co, , Cn—1. 

(b) If M and L are different permutations of S, then ML— fixes at most one 
letter. 

(c) If L does not take c; into c;there is one and only one permutation M which 
does take c; into cy such that ML displaces all letters. 

(c’) nis finite. 


REMARK. Note that an interchange of the centers of perspectivity, A 
and B, replaces S by S-!, the set of inverses of elements in S, and that S-! 
satisfies conditions (a), (b), and (c) if and only if S satisfies them. 

Proof. In this proof we shall use conditions (a), (b), and (c). See appendix I 
for a proof that (a), (b), and (c’) imply (c) and also for counter-examples 
showing that when 2 is infinite (a) and (b) do not imply (c). 

Suppose S is the set of permutations associated with lines L not through 
A or B in a Euclidean plane x*. If x1, x2 are two different symbols and 91, 72 
are two different symbols then there is a line Z through (x1, y1) and (x2, ys), 
and since x1, and y:+y2 this line does not pass through A or B. Hence 
L=(j7":) and (a) is satisfied. Now if M and L are two different lines ML— 
fixes a letter c; if and only if M and L both take ¢; into the same letter, say c;, 
whence M and L have the point (c;, cj) in common. Since M and L have at 
most one point in common ML-' can fix at most one letter. Hence (b) is 
satisfied. Now if Z does not take c; into c;, that is if (c;, c;) is not on L, then 
there is exactly one line M parallel to L through (c;, cj), whence M takes c; 
into c;and ML- displaces all letters. Hence (c) is satisfied. 

Conversely suppose S satisfies conditions (a), (b), and (c). Construct 
points (xi, yj) and lines x=c;, and with LZ from S. Then 
the points and lines of this system satisfy the axioms for a Euclidean plane z*. 
Using (a) we may verify that there is a line through two arbitrary points. 
From (b) we may show that two lines intersect in at most one point. From 
(c) it follows that through a given external point there is exactly one line 
parallel to a given line. In the verification of these axioms, the lines x =¢; 
and y=c; form two sets of parallels and assure us that there is a line through 
(x1, yx) and (x2, even if x1=%2 or yi= Jo. 

In a plane any two lines contain the same number of points (here » for 
the Euclidean plane, »+1 for the projective plane). Hence if two planes are 
isomorphic the number of points per line must be the same for both. Since 
in Theorem 5.2 the symbols ¢o, - - - , ¢,-1 are any » distinct symbols, without 
loss of generality we may assume that the same symbols are used in repre- 
senting any two planes which have the same number of points on a line. 


THEOREM 5.3. The plane m, represented on the pencils through A; and By, 
using symbols Co, , Will be isomorphic to the plane m2 represented on the 
pencils through Az and Bz, using symbols co, + , Cn-1, With and 
if and only if there exist two permutations X and Y of Co, +++, Cn—1 Such that 


247 


1943] PROJECTIVE PLANES 
X-1S,Y =S, where S; is the set of L permutations of 3, and Sz is the set of L 
permutations of m2. Here X =(c:%'":) where the line x =c; of the A, pencil is 
mapped onto the line x=c; of the Az pencil, and Y=(°%°::) where the line 
y =c, of the B, pencil is mapped onto the line y=, of the Bz pencil. 


Proof. Suppose there is an isomorphism 7;<272, Ai1=A2, B;=B:. Then the 
pencil through A; is mapped onto the pencil through A», and that through 
B, mapped onto that through B;. Hence correspondences x =¢;=2x =c; and 
are determined. Write Y= Then 
Pi(ci, Here if and then Hence if the 
permutation takes c; into c, then X—'Z, Y takes c;into c, whence Lz = X-1L, VY 
as was to be shown. Conversely suppose that X—4.S, Y= S;. Then let us set up 
the correspondences A1=A2, x =c;, y=c,2y =c, and c,) 
=P,(c;, c,). Also put if Lz = X-1L, Y. Here PiCLi if and only if P2CLs. 
Hence this is an isomorphism between m and 72, which maps A; onto A: 
and B, onto By. 

We may use the permutations S associated with a plane @ to introduce as 
another representation of w a ternary ring. This ternary ring, as will be proved 
later, reduces to the usual coordinate ring in an analytic plane, but, without 
conditions of Desarguesian type, it will have very few of the ordinary ring 
properties of a skew-field. Among the various equivalent representations of + 
which Theorems 5.2 and 5.3 yield we may, by taking X=L, Y=TI, obtain 
a representation in ‘which an arbitrary line L@Z-'LI=TI is represented by 
the identity. Now take two arbitrary symbols as co=0, and c,=1. We 
keep unchanged the representation of the lines x =c;, y=c;. The remain- 
ing lines M we use to define a ternary operation y=x-m o b on the symbols 
0, 1, ce, + * * , Cas. To a given line M correspond parameters m, b in the fol- 
lowing way: 

(1) If M=({}"::), then for M take m=s, b=0. 

(2) If M goes through (0, a), a0, and is parallel to ($:}':::) then for M 
take m=s, b=a. If (x, y) is any point on M define the ternary operation as 
y=x-m o b. Here a point is represented by a pair of coordinates (x, y) and a 
line contains all those points bound by the ternary relation y=x-m o b. This 
operation is defined for all x, m, 6 selected from the system 0, 1, cz, + ++ , Cn1 
except for triples with m =0. We may extend slightly by defining x-0 0 b=b for 
any x and thus include the lines y=c; in the general form. In the analytic 
planes (0, 5) is the intercept of M on the y-axis, m is the slope of M, and the 
equation of M is y=xm-+-b. What in general are the properties of this ternary 
operation and when does such a ternary operation define a plane? 


THEOREM 5.4(7). A ternary operation a-b o c uniquely defined for all ordered 
triples a, b, c containing elements 0 and 1 may be derived from the incidences of a 


(7) Similar results, as yet unpublished, have been obtained by D. T. Perkins. 
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Euclidean plane having the following properties: 

(1) 0-moc=a-0o0c=c, 1-mo0=m, a:100=a. 

(2) x-mob=c, for given m~0, b, c, has a unique solution x. 

(3) a-moz=c, for given a, m, c, has a unique solution z. 

(4) a-z0b=c, for given aX¥0, b, c, has a unique solution z. 

(5) x-m,0b,;=x- mz 0 by has a unique solution x if 

(6) The pair of equations 0 b=y,, x2:m 0 has a unique solution 
m=s, b=a given x1A~x2, 

For each choice of a fundamental quadrilateral A, B, O=(0,0), I=(1, 1) ina 
the ternary operation is uniquely determined. Conversely a ternary operation on 
a system T with the above properties defines a Euclidean plane with points (a, b), 
a, bCT, and lines x =c and y=x-mob. 


Proof. As was shown above, a plane may be represented as in Theorem 5.2, 
taking one line Z as the identity, the coordinates being from a system 
T(O, 1, - , The points are (c;, c;) and the lines are x =¢;, y=x-mob. 
Here a-00c=c is part of the definition of the ternary operation and 
0-m o c=¢ states that (0, c) is on y=x-m oc. 1-m 0 states that (1, m) 
is on (9°22) if m#¥0 while 1-000=0 is a special case of a-O0oc=c. 
a-100=a states that ((}':::) is the identical permutation. So much for 
property (1). (2) states that y=c intersects (;':::) in exactly one point. In 
(3) if m=0,a:-moz=z=c and z is unique. If m0 it states that there is a 
unique parallel to (9...) through (a, c). (4) states that if a~0 and b=c 
then z=0 is a solution, whereas if a-m ob=c with m#0 we have (}?°:: 
parallel to ('m':.), hence c#b. If bc there is a unique parallel (¢°: 
through (0, 0) to (}%°:), and z=m is the unique solution. (5) states that 
the lines y=x-m, 0 b; and y=x-mz Oo be, if m:# me, are not parallel and inter- 
sect in a unique point (x, y). (6) states that there is a unique line through 
(x1, yi) and (x2, ye) if x1%x2, y1X%y2. Hence the ternary operation defined by a 
plane has properties (1) to (6) inclusive. 

Conversely suppose that in a system 7(0, 1, ¢2, - - - , n_,) a ternary opera- 
tion a:b oc is defined satisfying properties (1) to (6). Let us define points 
(ci, cj) and lines x=c;, y=cy=x-Ooc;, and lines M(m, if 
cj=c;-mo b for every m0, and.every b. Since x-m o 6 is uniquely defined for 
every x in M,c; takes on every value exactly once. From property (2) every c; 
occurs exactly once in M and so M is a permutation of the c’s. From property 
(6) the permutations M are doubly transitive. From property (5) M,Mz" fixes 
exactly one letter if the slopes m, and mz, are different. If m;=mz_=m, b1~ bz, 
x-m 0 b\x¢x-m O by since from property (3) a-m o z=c has a unique solution, 
and in this case M, M7" fixes no letter. If m; = mz and 6; = bz, the lines M, and M2 
are identical. From the definition of the ternary operation and property (2) 
it follows that no line y=x-m o b with m0 can be parallel to a line x =c 
or y=c. Hence through an external point there is exactly one parallel to a 
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line of slope m whose slope is m (property 3), and no parallel with a different 
slope. Hence the conditions (a), (b), (c) of Theorem 5.2 are all satisfied and 
the ternary relation does in fact define a plane. In addition, one of the lines, 
y =x-100, is the identity y =x. Note that in this coordinatization of the plane 
not only A and B, but also (0, 0) and (1, 1) play a special role and determine 
from the plane the ternary operation to within isomorphism. Different choices 
of the fundamental quadrilateral may lead to systems not isomorphic in any 
algebraic sense. For examples, see appendix ITI. 

We may define both addition and multiplication as special cases of the 
ternary operation. 

Addition: a+ =c if and only if a-10b=c. 

Multiplication: ab =c if and only if a-bo0=c. Here we have 0+a =a+0=a 
and the existence of unique u, v such that u+-a=), a+v=5, though not even 
the right and left negatives of a need be equal. Also a0 = 0a =0 and ai = la=a, 
while for a#0, ua =}, av =b have unique solutions, though not even the right 
and left inverses need be the same. In general no other ring properties hold, 
nor will it be true that x-m o b=xm-+b. 

In an affine analytic plane lines y=xm-+5 with m0 may be associated 
with the transformations x-xm-+5 of the linear group. Here the permuta- 
tions (<::13°:::) will be the elements of the linear group. Hence when skew- 
field coordinates have been introduced, the permutations of Theorem 5.2 form 
a group. But it is possible for these permutations to form a group even in 
certain non-analytic planes. See appendix II for an example of such a plane. 


THEOREM 5.5. If the Ai—B, permutations of a plane m form a group G, 
the Az—Bz permutations of a plane m2 isomorphic to with 
form a coset of a transform of G in the symmetric group on Co, - - + , Cn—r- 


Proof. The permutations 5S; of 7 are the group G. By Theorem 5.3 the 
permutations of m2 are S;=X~-1S,Y for appropriate permutations X and Y. 
Hence S:=X—'GY=X-'GX -X-'Y is a left coset of X-'GX, a transform of G. 
Also S:=X-!Y- Y—'GY is a right coset of Y"GY. 


THEOREM 5.6(°). A permutation group G on letters co, - - - , Cn—1 Such that 

(a) G is doubly transitive, 

(8) only the identity fixes two letters, 

(y) at most one element taking c; into c; displaces all letters, or 

(y’) nis finite 
will be the permutations S of a Euclidean plane x*. The identity and the elements 
of G which displace all letters form a simply transitive normal abelian sub- 
group H. 

(®) When 1 is finite, this theorem rests on a theorem of Frobenius, A. Speiser [1, p. 200], 


depending on the theory of group characters, or on the Sylow theorems as proved in R. D. 
Carmichael [1, p. 145]. Compare Lemma 8 with page 25 in G. Thomsen [1]. 
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Proof. In a group, clearly conditions (a) and (8) are equivalent to condi- 
tions (a) and (b) of Theorem 5.2. As is shown in appendix I, (a), (b) and (7’) 
imply (c). When 1 is infinite condition (7) is only half of condition (c). It is 
the “only one” part of the condition. We shall see in Lemma 5 below that the 
part “there exists one” of condition (c) is a consequence of (a) and (8) for a 
group G. The second counter-example in appendix II shows that this is not a 
consequence of conditions (a) and (b). The author does not know whether (a) 
and (8) for a group G imply (y). A counter-example, if it exists, would have no 
particular geometric interpretation and would be of relatively little interest 
here. When Lemma 5 has been reached, we shall have shown that the permu- 
tations of G are the permutations S of a plane. 


LemMaA 1. There exists one and only one element of order two in G which 
interchanges a pair of letters (i, 7). 


Since G is doubly transitive there exists an element g=("77::}). If h 
also interchanges and j then fixes both and whence gh-!=1, g=h. 
Hence g is unique. Also g? fixes both 7 and j and hence g?=1. 


LEMMA 2. The elements of order 2 in G are a conjugate class. 


An element of order 2 fixes at most one letter. If 2 =2 there is only one ele- 
ment of order 2 and the lemma holds. If 23 then g and & of order 2 must 
both displace some letter 4; g= (ij) - - - , h=(tk) - - -. G contains an element 


x=(Pi") and x—'gx=(ik) --- =h. If an element fixes a letter, then any 
conjugate will also fix some letter. Hence we may subdivide into two cases: 
Case I, The elements of order 2 displace all letters; Case II, Every element of 
order 2 fixes a letter. 


LEMMA 3. In Case II there is one and only one element of order 2 fixing a 
given letter. 


In Case II, as before, g=(ij) --~- is transformed into h=(ik)--- by 
x=(¢2£:). But here if g and h both fix the letter s, then x must also fix s. 
Hence x #1 fixes both ¢ and s which contradicts assumption (8). If g fixes a 
letter s and ¢ is any other letter, an x taking s into ¢ will transform g into an h 
of order 2 which fixes ¢. 


Lemma 4. The product of two different elements of order 2 is an element dis- 
placing all letters. 


Suppose to the contrary that g?=1, h*=1, gh, and gh fixes the letter i. 
If either g or / fixes ¢ then the other must also, but this is impossible by 
Lemma 3. If both g and h displace 7, then the letter j into which g takes i must 
be taken into ¢ by h and g=(ij) - - - , A=(ji), whence by Lemma 1 g=h, 
contrary to assumption. Hence it is impossible that gh could fix any letter. 


4 
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Lemma 5. In G there ts at least one element, displacing all letters, which takes 
given i into a given j 

In Case I there is an element of order 2, displacing all letters, which takes 
4 into 7. In Case II there is an element g of order 2 fixing 7, and an element h 
of order 2 taking 7 into j. Here gh takes 7 into 7 and by Lemma 4 displaces all 
letters. 

As yet we have made no use of assumption (vy). From now on it is neces- 
sary to use it. From Lemma 5 and (v7) there is exactly one element in G, dis- 
placing all letters, taking ¢ into 7. Now if L is an element taking k into j, 
k ¥i, then let T be the element displacing all letters taking ¢ into k. Then take 
M=TL. Here M takes i into 7 and ML-!=T displaces all letters. Since T 
is unique, M also is unique. This proves property (c). Hence the elements of G 
form the permutations S of a Euclidean plane z*. 


LemMA 6. In Case I every element displacing all letters is of order 2 and these 
together with the identity form a normal abelian subgroup. 


Since there is an element of order 2, displacing all letters, taking 7 into j, 
this must by (7) be the only element, displacing all letters, taking i into j. 
But since ¢ and j are arbitrary, every element displacing all letters must be 
of order 2. If g?=1, h*?=1, then if g=h, gh=1, while if gh, gh displaces all 
letters and hence (gh)?=1, whence hg=gh, and the elements of order 2 to- 
gether with the identity form an abelian subgroup H. By Lemma 2, H is a 
normal subgroup. This completes the proof of the theorem for Case I. 


Lemma 7. In Case II an element g displacing all letters may be written as 
the product g=ab of two elements of order 2 where either a or b may be chosen 
arbitrarily. 

Given g displacing all letters and a¥1, a?=1. Suppose that a fixes the 
letter ¢ and that g takes 7 into 7. Let b=(ij) - - - . Then ad takes 7 into j and 
by Lemma 4 displaces all letters, whence by (vy), g=ab. Similarly, given g 
and 6 fixing k, if g takes 7 into k and a=(tk) - - - , then g=ab. 


Lemma 8. In Case II the product abc of three elements of order 2 1s again an 
element of order 2, and abc =cba. 


Here the lemma is trivial if b=a. If ba, then ab=g=dc where, by 
Lemma 7, d?=1. Hence abc = dc? =d. Since d =d-!, abc = cba. 


LemMA 9. In Case II the elements displacing all letters, together with the 
identity, form a normal abelian simply transitive subgroup H. 


By Lemmas 4 and 7 these are the elements of the form ab where a and b 
are of order 2, and hence are a normal set. By Lemma 5 this set is transitive. 
Clearly the inverse of ab is ba. To show that these form an abelian subgroup 


| 
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we must show gig2=1 or gs and gige = gog:. Here gige = a1b1d2b2 and by Lemma 7 
gi=da2, whence gige=db,=1 or g3. By Lemma 8 gigs = (a1b1d2)b2 = (a2b101) be 
= d2(bi1d1b2) = d2(b2a1b1) = gogi. This completes the proof of Theorem 5.6. 


THEOREM 5.7. Under the assumptions of Theorem 5.6 in x* we have 
x:-mob=xm-+b and addition and multiplication satisfy: 

Al. (a+b)+c=a+(b+0), 

A2. b+a=a+b, 

A3. a+0=0+a=a, 

A4. a+(—a) =(—a)+a=0, 

M1. (ab)c=a(bc), 


M3. ai =1a=a, 
M4. aa-!=a-‘a = 1 if a #0, 
MS. a0=0a=0, 


D1. (a+b)m=am+bm, 
E3. xr=xs+t has a unique solution x if r#s. 


Proof. Following Theorem 5.4 the identity is J=($}':). If 
Ag=(S2%") is the permutation taking 0 into a and displacing all 
letters then v=u-1oa=u-+a. If then s=w-m o 0=wm. 
An arbitrary permutation may be written in the form L=($}"":) 
= = Mn Ae whence x-m o b=xm-+b. 

From Theorem 5.6-the A,’s and J form an abelian subgroup H whence 
Al, A2, A3, and A4 hold. The M,,’s and J form the subgroup of G fixing the 
letter O, whence multiplication forms a group and Mi, M3, M4 hold. To 
show that the one-sided distributive law D1 holds we make use of the fact 
that H is a normal subgroup in G. Let Aa=(2"%), As=(3204%,72), 
Here since all letters 
are displaced. Also Mir!AsMn=Asm. Hence 
= =A amA tm =A am+bm, whence (a+b)m 
=am-+bm. The other distributive law a(r+s)=ar+as will not in general 
be true. The law E3 is an immediate consequence of property (5) of Theorem 
5.4 and the fact that x-m o b=xm-+5 in this theorem. 

We may modify the notation for points and lines in r* and adjcin L, 
and the points on it to obtain a pseudo-homogeneous representation for the 
projective plane 7. Let P*=(x, y) in x* be written P=(x, —y, 1) in m. If 
the line y=xm+) in x* cuts L,, in 7 in the point Q, write Q@=(1, —m, 0) in z. 
Finally write A =(0, 1, 0). Then the lines y=xm-+5 in r* become y+xm-+2b 
=0 containing points (x, y, z) inw. Also write x =c in r* as x+2(—c)=Oinz, 
and finally L,, is z=0. The associative law of multiplication and D1 permit 
us to take any equation ya+xb+2c =0 as the equation of a line, but without 
the distributive law D2: a(6+-c) =ab+<ac we cannot write the points in homo- 
geneous form. The associative law of multiplication, M1, is not necessary in 
order that a ring satisfying the remaining axioms of Theorem 5.7 define a 


i 
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plane, and such rings and planes will be investigated below. It is to include 
such rings and planes in the above transition from an affine to a projective 
plane that the apparently unnecessary step of changing the sign of the y-co- 
ordinate is made, since from the remaining axioms it does not follow that 
—xm=x(—m) although it is true that —xm=(—<x)m. 

Veblen and Wedderburn(*) have constructed certain number systems 
suitable for the coordinatization of planes very closely related to the rings 
characterized in Theorem 5.7. A more thorough investigation of these number 
systems and the planes which they define is given here. In §6 it will be shown 
that these systems correspond to a particular configuration theorem and 
further properties of these planes will be considered. 


THEOREM: 5.8. Given a ring R in which addition and multiplication satisfy: 
Al. (a+6)+c=a+(b+0), 
A2. b+a=a+5), 
A3. a+0=0+a=a, 
A4. a+(—a)=(—a)+a=0, 
(5.4) MS. 0a =a0=0, 
D1. (a+b)c=ac+hbe, 
E1. xa=6, a0, has a unique solution x, 
E2. az=b, a0, has a unique solution z, 
E3. xr=xs+t, r¥s, has a unique solution x. 
Then if 6#0 and y¥0 are fixed but arbitrary in R we may define points 
(x, y, 2) of forms (x, y, &), (b, y, 9), (0, 6, 0) where x and y take all values in R. 
Lines are defined as those sets of points satisfying one of the equations y)+xb+2c 
=0, x+2c=0, 2 =0, where b and c take all values in R. Points and lines so de- 
fined form a projective plane(') x. 
The plane x is independent of the particular choice of @ and wy. We may de- 
fine a new multiplication o by the rule 


(5.5) u=vosif and only if sy = on, uy = vr. 


The operations +, 0, define a ring R, satisfying all the properties which R does 
and determining the same plane. R, has the additional property that is a unit: 


(5.6) M3 ¢0a=a0¢=— <4. 


Proof. We must first show that there is one and only one line through two 
distinct points P and Q. We must treat cases separately according to the 


(*) O. Veblen and J. H. M. Wedderburn [1]. 

(?°) The systems used by O. Veblen and J. H. M. Wedderburn are slightly different. Here 
the right distributive law is used rather than the left, and the roles of the x and the y are re- 
versed. To go from their planes to these given here we consider the rings anti-isomorphic to 
theirs, take the dual plane, and then interchange the x and y. They do not state the analogue 
of the necessary condition E3, nor do they observe that the plane is independent of the choice 
of and y. 


4 
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different forms for P and Q. Every case goes through almost trivially except 
for P=(x1, 6), Q=(x2, ya, 6), and here with x1+x2, Neither P 
nor Q lies on af = 0 and we cannot have = 0 and xw+2c =0 with x2. 
Hence the line PQ, if there is one, is of the form ywW+xb+2c =0. Now observe 
from D1, putting b= —a, that 0=O0m=am-+-(—a)m whence —am =(—a)m. 
Hence if ywW+x1b+¢c=0 and ywW+x2.b+¢c=0, by subtraction and D1 
(91 + (x1 —%2)b =0 whence From E1 this de- 
termines a unique 6. Then from E2, ¢c= —yy—xb determines c uniquely. 
From this determination of 6 and c we find a unique line containing P and Q. 
To show that two different lines intersect in one and only one point we must 
again treat several cases according to the form of the lines. These all go 
through quite easily except when they are of the forms y/+xb,+2c,=0, 
and here only when Since it cannot be a point 
(0, @, 0) or (¢, y, 0) with b:+b2. Hence it must be a point (x, y, @). Substitut- 
ing z=@ we have xb; =xb2+$¢2— aq and by E3, since b;~be, there is a unique 
solution x. Then yW = —xb,—2c; by E1 determines y uniquely and we find a 
unique point (x, y, @) on both lines. Hence these points and lines form a pro- 
jective plane z. 

To show that 7 is independent of the particular choice of ¢ and y, we 
show first that for @ fixed it is independent of y, and then that for y fixed it 
is independent of ¢. ; 

Let m7 be the plane determined by ¢ and y,, and 7; be the plane determined 
by ¢ and yx. The relation m/,=u'f. determines by E1 a 1-1 mapping uzun’ 
of R into itself leaving 0 fixed. Also let ¢c = —ay, dd = —aye determine the 
mapping c=2d. The correspondences 


(x, = (x, 9’, 
(¢, y, 0) = (9, 9’, 0), 
(0, , 0) = (0, ¢, 0), 
Wi + xb + = OF + + 2c = 0,7 
2d = 0, 
ay, = 2, = 0, 
set up an isomorphism between 7; and m2. Hence 7 is independent of the 


choice of y. 
Now let m be the plane determined by ¢; and ¥ and 7 be determined 


by ¢ and y. By E2, ¢1=q@2.u’ determines a 1-1 mapping of R onto itself 
leaving 0 fixed. Also let y= be determined by yW = —¢1b, —¢2b = SY. Then 


the correspondences 


(5.8) (¢1, 0) ($2, 4, 0), 
(0, $1, 0) (0, $2, 0), 


* 
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Wt = 0, 
(5.8) a+ 2c’ = 0, 
=0, 


are an isomorphism between 7 and 7:2. 

Now let us form a new ring R; whose elements are the same as those of R. 
In R; addition is the same as in R, but multiplication is a new operation (0) 
as defined by (5.5), using arbitrary but fixed values for @ and y. In Ri 
Al, 2, 3, 4 are clearly satisfied since the operation of addition is the same as 
in R. MS for R; comes immediately from (5.5). Leta om=w,bo m=z, whence, 
by (5.5), mlb =¢c, ay =we, b) =2c. Adding the last two of these and using D1 
for R we have (a+5)y = (w-+2z)c, whence, by (5.5), (a+b) om=w+z=a0om 
+5 o m, proving D1 for R:. Now x o a=} with a0 is equivalent to ay = ¢c 
and bj =xc, where the first determines c¥0 uniquely and the second deter- 
mines x uniquely, proving E1 for R;. Similarly E2 for R; follows immediately. 
Ifxor=xos+t, we have withxor=z,xos=w, 


zs=wti, ry = om, sy = gn, 


(5.9) 
zy =-xm, wy = xn, sy = wy + Wy, xm = xn + by. 


Since rs, m is different from m, and so by E3 for R the last equation deter- 
mines x uniquely. This proves property E3 for R:. This shows that R; satis- 
fies all the properties which R does. Also in R, if ao ¢=b, then py=¢gm, 
by =am, whence m=y and b=a. If ¢ then ay=gm, b) = gm, whence 
ay = by and a=5b. Hence in R; the rule M3, oa=a 0 =a , is satisfied. 

It remains to show that R; defines the same plane x which R defines. For 
this purpose we shall take ¢ and y from R to determine the plane and @ 
and ¢ from R;. Remembering that in R; we may write ¢ as 1, we see that the 
isomorphism is 

(x, y, = (x, y, 1), 
(, y, 0) = (1, y, 0), 
(0, d, 0) = (0, 1, 0), 
=0, 
wy =022=0, 


where c’y = ¢c, b’f = $b. Note that the isomorphism does not alter the coordi- 
nates of points. If we use the correspondence s=s’, where s’f =¢s, then from 
(5.5) we have immediately (vo s’)y=vs. Multiplying the line equations for R; 
on the right by ¥ and using this rule, we see immediately that they are equiva- 
lent to the line equations from R just as the correspondence is given in (5.10). 


255 
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This process is of course reversible in going from line equations from R to 
those from R;. This establishes the isomorphism, and completes the proof. 


THEOREM 5.9. If a plane w is determined by a ring R satisfying axioms 
Al, 2, 3, 4, M3, 5, D1, and E1, 2, 3 as in Theorem 5.8 then R is the ternary ring 
determined by x as in Theorem 5.4, where x-mob=xm-+b, taking A, B, (0, 0, 1) 
and (0, —1, 1) as the fundamental quadrilateral. The correspondence between the 


two representations is given by 


(0, 0) = (0, 0, 1), 
(1, 1) 2 (1, I, 1), 


x=cext+(—o 


y= am+beyt+ 


The mapping (x, y)—>(x+s, y+?) for arbitrary s, tCR is a collineation of x 
leaving the points of L., fixed. If a plane m is determined by such a ring R 
and m2 is any plane isomorphic to with then in the ring 
of m2, x-m 0ob=xom-+b where the addition (+-) and multiplication (0) satisfy 
axioms A1, 2, 3, 4, M3, 5, D1, and E1, 2, 3. Moreover Rz may be considered as 
containing the same elements as R, where addition is the same and multiplication 


(6) may be determined from that of R by the rule 
(5.12) w= xod if and only if ds = at, ws = xt, 
for appropriate fixed a~0, s#0, and all x and d. 


Proof. Given a ring R with unit determining a plane as in Theorem 5.8. 
The correspondences (5.11) determine points (x, y) and lines x =c, y=xm+b 
satisfying the postulates for a Euclidean plane isomorphic to the finite part 
of the projective plane given by the pseudo-homogeneous coordinatization 
from R. Let us now introduce coordinates in this plane according to the rules 
of Theorem 5.4 with A, B, (0, 0) and (1, 1) as the fundamental quadrilateral. 
A line y=xm goes through (0, 0) and (1, m) and hence is the line y=x-m 0 0. 
The line y=xm+b with 50 is parallel to y=xm and goes through (0, 3) 
and hence is y=x-m o b. Here lines x =c are clearly the same in both repre- 
sentations. Hence x-m o b=xm-+5 and R is the ring determined by 7 as in 
Theorem 5.4. 

The mapping (x, y)—(x+s, y+é) takes the points of the line x =c into 
those of x =c+s and the points of y=xm-+5 into those of y=xm+(sm+5—1?) 
whence it is a collineation leaving the slopes of lines unchanged and hence 
leaving the points of L,, fixed. As s and ¢ are arbitrary these collineations are 
simply transitive on the finite points of 7. 

Given a plane 7 determined by a ring Ri, let 72 be a plane isomorphic 
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to 7; with A,=A>», B,=B;. Since there are collineations of 7; transitive on its 
finite points and leaving A and B fixed there will be such an isomorphism 
which also maps (0, 0) cf 7 onto (0, 0) of m2. Hence for an appropriate s, 
y =xs of 7 is mapped onto y=<x of m2, and we have (x, xs)—>(u, u). Now re- 
place the symbols used in 72 so that this mapping becomes (x, xs)—>(x, x). 
Hence in general, mapping 7 onto 7, 


(5.13) (x, ys) — (x, y). 


Now consider the natural ring R: of m2 (the ternary ring of Theorem 5.4) 
determined by the fundamental quadrilateral A, B, O=(0, 0) and I=(a, a). 
Note that the symbols used are taken from 7 and that the unit 1 of 7; need 
not be the unit a of m2, although their respective origins coincide. A line 
y =x-a0C is the image of a line y=xs+# where (0, ¢)—>(0, c) whence t=cs and 
a point (x, y) on it is the image of (x, ys). Hence y=x-+c in R; if and only if 
ys =xs+t=xs+cs =(x+c)s in Ri, whence y=x-+<c in R; and so addition in R; 
is the same operation as addition in R;. In Re, y=x-d o 0 goes through (0, 0) 
and (a, d) and so is the image of the R, line through (0, 0) and (a, ds), which 
is y=xt where ds =at determines ¢. Hence if w=x o d in R, then (x, w) in m2 
is the image of (x, ws) in 71 where ws =xt. Hence ds =at, ws =xt in R; define 
w=xo0d in R:, and this is the rule (5.12). In mz the line y=x-m 0 b goes 
through (0, d) and is parallel to y=x om which goes through (0, 0) and 
(a, m). These are the images of (0, 0) and (a, ms) in m. Hence it is the image 
of y=xt where at=ms. Also (0, bs)—>(0, 6) and so y=xt+bs—-y=x-m 0 b. 
Hence by (5.12) y=xt+bs-y =x o m+5, and so in Re, x-mob=x0m-+b. 
We may now verify, as in Theorem 5.7, that R; has properties A1, 2, 3, 4, 
M3, 5, E1, 2, 3 where a is the unit of Re. 


THEOREM 5.10(!"). Rings Ri and Re with properties A, M, D, E of Theorem 
5.9 determine planes and m2 isomorphic with if and only if 
each can be obtained from the other by rule (5.12). If in R, multiplication is 
associative then Rz is isomorphic to R; and Ri@Rz by the rule (5.14), ua "=u, 
ua~'+va—"u +2, (ua-") (va) =u 


Proof. From Theorem 5.9, R; and R2 are each determined by their planes 7 
and 72 and the choice of the fundamental quadrilateral A, B, O, I. If A:mAz, 
B,=B:, R: may be obtained according to Theorem 5.9 from R: with the same 
addition and with multiplication given by the rule (5.12). Similarly R; may 
be obtained from R>. If in R; multiplication is associative, from ds = at, ws = xt 
we have ws =xa~'ds, whence x d=xa~'d. Using this we 
easily verify the correspondences (5.14). Note that if multiplication is associa- 
tive, then the substitutions x-xm-+), m0, form a doubly transitive group 
as in Theorem 5.6. 


(2) If R: does not have an associative multiplication R; need not be algebraically isomor- 
phic to R;. See appendix II for examples. 
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We shall summarize the results of this section briefly. For every choice 
of a pair of points A, B in a plane z a set S of doubly transitive permutations 
is determined (Theorem 5.2) and if L, M are two of these, ML-" is associated 
with a projection in 7. Conversely a doubly transitive set S of permutations 
satisfying the necessary conditions determines a plane + (Theorem 5.2). 
Every choice of a quadrilateral A, B, O, I in determines a ternary ring R 
(Theorem 5.4) with a zero and a unit. If 7 is an analytic plane over a skew- 
field F then R= F with x-m o b=xm-+6 for every choice of A, B, O, I (equa- 
tions 5.2, Theorem 5.9). A plane represented by a Veblen-Wedderburn num- 
ber system is independent of the choice of ¢ and y, and may also be repre- 
sented by a Veblen-Wedderburn number system with a unit (Theorem 5.8), 
and this system is one of the rings R determined by 7 as in Theorem 5.4 
(Theorem 5.9). Multiplication in this system is associative if and only if the 
A —B permutations S form a group (Theorems 5.6, 5.7, 5.10) or a coset of a 
group (Theorem 5.5). 

6. Configuration theorems. A configuration theorem is a statement of the 
following sort: Given points P;, ---, P,, and lines Zi, - --, Z, and the in- 
formation that (1) at least certain points (P) are distinct; (2) at least certain 
lines (L) are distinct; (3) P;CL; for certain points and lines; and (4) P,CL, 
for certain points and lines, then we may conclude that there are further in- 
cidences P,CLy. 

Axiom A3 of Veblen and Young is of this type as are also the Theorems 
of Desargues and Pappus. It is natural to enquire what theorems of this sort 
are possible and what relationships exist between such theorems. Outstanding 
in this connection is Hessenberg’s proof that the Theorem of Desargues is a 
consequence of the Theorem of Pappus(!*). Moufang(") has investigated a 
number of theorems which are consequences of the Desargues Theorem and 
in particular has shown that the Theorem of the complete quadrilateral is 
definitely weaker than the Theorem of Desargues. For the most part Moufang 
assumes not only Hilbert’s axioms of connection but also the axioms of order. 
Most of her results are, however, true without the hypothesis of order. 

In a configuration theorem, we are given a configuration C and the con- 
clusion leads to a configuration C* containing the points and lines of C and 
further incidences. In the statement of a configuration theorem above, C* was 
assumed to have exactly the same points and lines as C. It is an unimportant 
generalization to permit C* to contain points and lines constructed from C. 
If the given configuration is not open then Moufang calls the theorem con- 
ditional. If C is open and C* is obtained by adding a single incidence to C, 


(*) G. Hessenberg [1]. 

(8) R. Moufang [1-7]. She has shown in [6] that the validity of the theorem of the com- 
plete quadrilateral is equivalent to the possibility of introducing coordinates from an alternative 
field, and that there exist alternative fields (that is, the Cayley algebra) which are not skew- 
fields. Alternative fields are covered in M. Zorn [1, 2] and in R. Moufang [7]. 
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we shall call the theorem simple. Most theorems may be treated as a com- 
bination of several simple theorems. C is free equivalent by Theorem 4.9 to 
some r-point (4.3) and if C* is also open it is free equivalent to some (r—1)- 
point, since in Theorem 4.11 P and L are unchanged and J has been increased 
by 1. This means that any finite configuration in the plane can be generated 
by an (r—1)-point and in a restricted way. All such theorems which the au- 
thor has investigated are demonstrably impossible. We shall restrict our at- 
tention to theorems where C is open and C* is confined. For if C* is not open 
then it contains a confined configuration which may be obtained by free 
contraction from C* and this may be considered the consequence of the ap- 
propriate incidences in part of C. 

As a further refinement in the classification of configuration theorems we 
distinguish between theorems in which the points and lines of C are aibitrary, 
apart from the mutual relationships required of them, and theorems in which 
certain points and lines of C are fixed. The first type we shall call a universal 
theorem, the second a fixed point theorem. Hilbert’s formulation of the 
Desargues Theorem is a fixed point theorem in that he requires the axis of 
perspectivity always to be the line at infinity. Theorems on webs(") are of 
course fixed point theorems. 

The Theorem of Pappus in a plane 7 is equivalent to the possibility of 
introducing coordinates from a commutative field F. What further configura- 
tion theorems can there be in such a plane 7? Given C we may take four of its 
generating points as a fundamental quadrilateral and introduce / as the natu- 
ral ring of determined by this quadrilateral. The other points and lines of C 
may be represented by values from F, including parameters representing the 
arbitrariness of the choice of the generators of C. The conditions (1), (2), (4) 
will correspond to a relation A(u, - - - , w)#0 where A is an appropriate ra- 
tional function of the parameters u, - - - , w. After we express the coordinates 
of the points and the equations of the lines in terms of the parameters the 
conclusion PCL will determine a relation B(u, - - - , w) =0. Hence the theo- 
rem may be stated thus: B=0 whenever A ~0, whence AB=0 for all values 
for which AB is defined. If AB=M/N where M and N are polynomials, then 
MN=0 for all values of the parameters. Here MN is a polynomial, simplified 
only according to the general rules of fields and not according to special prop- 
erties of F, such as finite characteristic or particular relations between fixed 
elements of F. If MN is formally zero, then the theorem is a consequence of 
the field properties of F and hence of the Theorem of Pappus. If 4N is not 
formally zero and involves no parameters, then the theorem either implies 
that F has finite characteristic or expresses a relation on fixed elements of F. 


(“) W. Blaschke and G. Bol [1] is an excellent work on the subject of webs and includes an 
extensive bibliography. A long series of papers on Topologische Fragen der Differentialgeomctrie, 
beginning in 1927, mostly appearing in the Abh. Math. Sem. Hamburgischen Univ., are largely 
concerned with web configurations. 


{ 
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In the latter case the entire configuration is fixed. If MN contains parameters 
x1, x, then MN=ao(m, +, + +, In this 
case F must be a finite field. For either ao(x1, - - - , x,-1) =O forall x1, 
of F or there is a polynomial agx{+ - - - +a, which is not identically zero, 
which vanishes for all x,C F. In either event we are led eventually toa non- 
zero polynomial in one argument x which vanishes for all choices of x from F. 
But such a polynomial has only a finite number of roots and hence F must 
be a finite field. In the finite field with = p* elements every element satisfies 
x"—x =0. Our conclusion will be stated as a theorem: 


THEOREM 6.1. If the Theorem of Pappus is valid in a plane 1, then any con- 
figuration theorem which does not consist entirely of fixed points is either (1) a 
consequence of the Theorem of Pappus, (2) equivalent to characteristic p for F, 
or (3) equivalent to finiteness for F and hence for r. 


In a plane z if the Desargues Theorem is valid but not the Pappus Theo- 
rem, then corresponding to any configuration theorem in 7 not a consequence 
of the Desargues Theorem, there is an identity in the coordinatizing skew-field 
K. A universal theorem in 7 will correspond to an identity expressible en- 
tirely in terms of the unit 1 and parameters. Here there is no canonical form 
for rational functions of more than one indeterminate and little is known 
about the possibility of universal identities("). If there is a one-parameter 
universal identity in K, it can, as above, be shown to be equivalent to a 
polynomial identity with coefficients from the characteristic subfield. Hence, 
as 0, 1, - - + must be roots, the characteristic must be a finite p. If this iden- 
tity is F(x) =0 where F(x) is a polynomial which does not vanish identically, 
then we may suppose without loss of generality that F(x) has no multiple 
factors. In this case if m is the least common multiple of the degrees of the 
divisors of F(x) then, from well known theorems on polynomial decomposition 
modulo p, F(x) is a divisor of x” —x, where N=p*. Hence x” =x for every x 
in K, and x*-'=1 for every x~0 in K. For x0, x-!=x*-* and so the sub- 
field generated by elements x, y, - - - , zis a polynomial ring in these elements. 
Consider the subfield Q(x, y). Every element in Q is a linear combination 
with coefficients 0, 1, - - - , #—1 of elements from the group G(x, y) generated 
by x and y. Every element in G has order dividing N—1. If Burnside’s con- 
jecture is correct G must be finite("*). If this conjecture is true, then Q must 
be finite, and by the Wedderburn Theorem Q is a Galois field and yx =xy. 
Hence K is commutative, whence K itself is a finite field containing at most 


(4) In D. E. Littlewood [1] and A. R. Richardson [1] polynomial relations with coeffi- 
cients from K are considered. W. Wagner [1] considers universal polynomial relations and 
shows that none exist in ordered planes, but that there always exist universal identities in 
division algebras, since the finite basis permits a matric representation and matric identities 
exist. B. H. Neumann [1] and others have treated the analogous problem for groups. 

(*) This conjecture is discussed in B. H. Neumann [1]. Analogous ring and field problems 
are discussed in A. Kurosch [1]. 
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p* elements. But even if this conjecture on groups is false it may very well 
be true that x” =x, N=p* for all elements of a skew-field K, implies that 
K is finite (and hence is a Galois field with at most »* elements). For »=1, 
F(x) =x(x—1) - ++ (x—p+1) and K is the characteristic field K(p). For 
n=2 every element of K not in K(p) satisfies an equation of second degree 
over K(p). Hence x? =ax-+5, y?=cy+d, and x? and y? are expressible as func- 
tions of first degree. Also (x+y)*=g(x+y) +A and x*+xy+yx+y?= 1st de- 
gree, whence yx = —xy+ 1st degree. Here every third degree expression in x 
and y is also expressible as one of at most second degree. For example 
xyx=x(—xy+i1st degree) = —x*y+2nd degree=(ist degree)y+2nd degree 
=2nd degree. Hence every expression of third degree or higher can be re- 
duced to second degree, and 1, x, y, xy are a basis of Q(x, y) whence Q is finite 
and K is finite and contains at most p? elements. 

For n=3 let us suppose first that We have x’, (x+~)*, (x—y)* 
all of second degree. Combining these we get 


xty + xyx + yx? + xy? + yxy + y?x = 2nd degree, 
— x*y — xyx — yx? + xy? + yxy + y?x = 2nd degree, 
whence since p#¥2 
x*y + xyx + yx? = 2nd degree, 
xy? + yxy + y?x = 2nd degree. 


Using these we need retain for a basis of Q only x*y, xyx, xy*, yxy of third de- 
gree, and x*yx, x*y?, xyxy, yxyx of fourth degree. Using (xy)*, (yx)*, (xy+<x), 
(yx-+-y)* as of fourth degree, we find that xyxyx and yxyxy are of fourth degree. 
Hence of fifth degree terms we need retain only x*yxy and xyxy?, and of sixth 
only x*yxy?, and no terms at all of seventh or higher degree. Hence Q is finite 
and K is finite with at most p* elements. A similar but slightly longer argu- 
ment holds for p? = 2'=8. On the basis of these computations and other con- 
siderations the author makes the conjecture that in a Desarguesian plane 
there can be a universal five point theorem which is not a consequence of the 
Desargues theorem only if the plane is finite. 

A universal theorem on six or more points corresponds to a universal 
identity with two or more parameters. As noted above no such theorems exist 
in an ordered plane, but theorems.certainly exist in planes coordinatized by 
division algebras (skew-fields with a finite basis over their center). In any 
quaternion algebra(!’) if x, y, are three arbitrary elements, then 2(xy—y-x)? 
=(xy—yx)*z, or in words, the square of any commutator is in the center. 
This identity corresponds to a seven point universal theorem in Desarguesian 
planes. To find the fullest algebraic consequences of this geometric theorem, 
we prove the following theorem. 


(?7) For the general quaternion algebra, see A. A. Albert [1, p. 145]. 
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THEOREM 6.2. Given a skew-field K whose center is F. If 2(xy—yx)? 
=(xy—yx)*z for three arbitrary elements of K, then either K=F or K isa 
quaternion algebra over F. 


Proof. Here the square of every commutator xy — yx is in F. 


LEMMA 1. An arbitrary element x of K not in F satisfies an equation 
xt=rx+s with r, sCF. 


The proof is relatively simple if the characteristic of K is not 2. Here 
let y be an element which does not permute with x and write u=yx—xy, 
uw=aCF, v=yx?—x*y, v?=abCF. Also v=ux+xu and vu=uv=ax 
+uxu. Now u+v=y(x?+x)—(x?+x)y is a commutator, whence (w+)? 
=dCF and so u?+2uv+v?=d, and since the characteristic is not 2, 
uv=cCF. Also ux=x(yx)—(yx)x is a commutator and so uxux=aeC F. 
Hence c=ax+uxu, cx =ax*+ae, and as a~0 we may divide by a and ob- 
tain x?=rx+s, 7r,5sCF. 

When K is of characteristic 2 the above proof fails and the proof is more 
difficult. As before, choose a y which does not permute with x and write 
yx-+xy =u, yx?+x*y =0, v? =ab, v= ux+xu. Again uv =vu =ax+uxu 
and uxux =ae~0 since ux is a commutator. Here uvx = ax?+ae. Clearly x per- 
mutes with the right and hence also with the left of this equation. Since u 
and v permute we may square and obtain u*v*x?=a*x*-+-a%e? whence a*bx? 
=a’x*+a%e? and so x‘+bx?+e?=0 is an equation of fourth degree over F 
which x satisfies. If f(x) =x*+bx?+e? has a linear factor, then } is a square 
and f(x) is the square of a polynomial of second degree. Hence if f(x) is re- 
ducible over F then x satisfies an equation of second degree over F. In particu- 
lar if the choice of a different y gives different values for b and e, then x satis- 
fies an equaticn of second degree over F and the lemma is proved. Hence sup- 
pose that 6 and e depend solely upon x and are independent of y. Now 
uxux = ae = a(x* + bx*)/e whence uxu = a(x* + bx)/e, uxa = a(x* + bx)u/e, 
ux = (x*+bx)u/e. From this we easily calculate yx=xy+u, yx? =xyx-+ux 
=xty+ [xt+(b+e)x]u/e, yx*=x*y+(b+e)u. Consequently d=x?+(b+e)x 
permutes with y. Since d clearly permutes with every element which permutes 
with x, and also with an arbitrary element y which does not permute with x, 
it follows that d is in the center F. As x satisfies an equation of degree lower 
than the fourth, from earlier remarks x must satisfy an equation of second 
degree over F. This completes the proof of the lemma. 


LEMMA 2. If yz=zy, then either (1) 1, y, 2 are linearly dependent over F 
or (2) F has charactristic 2 and y*=a, 2*=b, with a, bCF. 


Suppose that 1, y, 2, yz are linearly dependent over F. Then either 1, y, z 
are linearly dependent over F or yz=cy+dz+e. Here (y—d)z=cy+e, 
2=(y—d)—(cy+e) =my-+n and in either case 1, y, are dependent over F. 
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Hence if 1, y, 2, yz are dependent, so are 1, y, 2. 

Let y?=my+a, a0, 22=nz+b, (yz)*=eyz+f, Then 
(my+a)(n.+b) =eyz+f and (mn—e)yz+anz+bmy+ab—f=0. Here either 
1, y, 2, yz are dependent and the lemma follows or m=n=e=0 and ab=f. 
Now let (y+z)?=p(y+2)+g. Here a+2yz+b=py+p2+4, and if the char- 
acteristic is not 2, 1, y, 2, yz are dependent and the lemma follows. If the 
characteristic is 2, then y?=a, 2?=b, (y+z)*=a+0 and no further conclusion 
may be drawn. This is the second alternative in the lemma. 

The remainder of the proof is quite straightforward and will be merely 
sketched. It is easy to show that if x and y are two elements of K which do 
not permute then 1, x, y, xy are a basis over F for elements generated by x 
and y. As in A. A. Albert [1, p. 145] we may suppose x and y so chosen that 
x?=x+ 8, xy=y(1—x), y?=y, 48+1 +0. Let z be an arbitrary element of K. 
We wish to show that z belongs to the quaternion algebra Q(1, x, y, xy). 
Replacing z by z+ mx, if necessary, we have 2?=c. If 2 permutes with x, by 
Lemma 2 z is dependent on 1 and x and so zCQ. Suppose then that zx ¥xz. 
From the equation (x+2z)*=p(x+z)+q we find that (p—1)(xz—zx) =0 
whence p= 1 and so we find that zx -+-xz =z+1r,rC F. Now putting 2’ =z+dx+e 
we can determine d and ¢ so that 2’*=c’, 2’x-++-xz' =2’. Now it is easy to verify 
that (yz’)x =x(yz’) whence, by Lemma 2, yz’ =s+iéx, since with x?=x+ 8 the 
second alternative cannot arise. Hence 2’ = y~'(s+-éx) CQ and also zCQ. Con- 
sequently K=Q and we have proved the theorem. Geometrically we have 
observed that in those planes coordinatized by quaternion algebras there is a 
seven point configuration theorem, not a consequence of the Desargues The- 
orem, which corresponds to the universal identity 2(xy—yx)? = (xy —yx)*s. 
Conversely we have proved that the assumption of the Desargues Theorem 
and this additional seven point theorem leads either to the Theorem of Pap- 
pus (when K is commutative) or to one of the planes coordinatized by a 
quaternion algebra. 

Although little is known about configuration theorems stronger than the 
Desargues Theorem, we are somewhat better informed about theorems 
weaker than the Desargues Theorem. The series of papers by Moufang is 
concerned with the theorem of the complete quadrilateral, and some special 
cases of this theorem and the theorems on webs, in particular the Reide- 
meister figure and the Thomsen triangle as given in the series of papers 
Topologische Fragen der Differentialgeometrie, are all consequences of the 
Theorem of Desargues. In fact the author knows of no valid universal theo- 
rem which is not at least consistent with the Theorem of Desargues. Theorems 
exist which are neither stronger than the Theorem of Desargues, nor conse- 
quences of it. Theorems of this sort known to the author are those which 
together with the Desargues Theorem would imply a finite characteristic for 
the coordinatizing skew-field. 

A configuration of considerable interest is given in Figure 4. It is the spe- 
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cial case of the Desargues configuration in which the center of perspectivity 
lies on the axis of perspectivity. The associated configuration theorem is 


given below. 


(M) (NV) B 


x 


THEOREM L. Given the lines AMNB, ARX, ASY, ATZ, RSM, RTN, 
XYM, XZN, BYZ, then also BST is a line. 


From the same configuration several other equivalent configuration theo- 
rems may be formulated. This is clearly a 4-web theorem as it stands, involv- 
ing the pencils through A, B, M=(m), and N=(m). Consider the ternary 
ring with A, B, O, I as the fundamental quadrilateral, where OJ intersects 
AB at N=(m)=(1). The points X, R on AX and Y on XM are arbitrary 
after A, B, M, and N are fixed. Let us take X = (0, 0), R=(0, 6), Y=(a, am). 
Then S=(a, a-mob), Z=(am, am), and T=(am, am+6). The conclusion 
that BST is a line leads to 


(6.1) a-mob=am-+b. 


Hence Theorem L with A, B, M, N, X as fixed points is precisely equivalent 
to the rule (6.1) which expresses the ternary relation as a combination of two 
binary relations. Or, otherwise expressed, it is the rule that a line be associated 
with a linear equation. 

Now let us keep A, B, M, N fixed and let X be arbitrary. Write X =(a, bd), 
R=(a, c) where a, b, ¢ are arbitrary. The line ASY also may be taken arbi- 
trarily as x =k. The equations of the lines, using (6.1), will be y=x+d for XN 
with b=a+d, y=xm-+e for XM with b=am+e, y=x+f for RN with 
c=a+f, and y=xm+g for RM with c=am-+g. The points now become 
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Z=(h, h+d), Y=(k, h+d) where h+d=km-+e determines h, T=(h, h+f), 
S=(k, km+g). The application of the theorem now yields km+g=h+-f. The 
relevant equations are 


b=a+d, b=am+e, c=a+f, c=am+g, 
(6.2) h+d=km+e, 


where a, b, c, k are arbitrary, the first five equations determine d, e, f, g, h 
and the final equation is a consequence of the theorem. Instead of taking 
a, b, c, k as the arbitrary elements we might equally well take a, e, g, h as the 
arbitrary elements and consider the system of equations 


a+d=am-+e, at+f=am-+g, h+d=km-+e, 
h+f=km+g, 


where the last equation is a consequence of the first three. Here taking e=0, 
h=0 we have a+d=am, a+f=am+g, d=km, and f=km+g=d+g whence 


(6.4) a+(d+g)=(a+d)+g 


for arbitrary a and g, and d determined by a+d=am. This is a restricted as- 
sociative law of addition. We may use this to prove the general associative 
law for addition. In (6.3) we may take d, e, g, h as arbitrary since, by (6.1) 
and property (5) of Theorem 5.4, if m#1, a+d=am-+e determines a uniquely 
from d, e and m. Here take e=0. Then a+d=am, a+f=am+g=(a+d)+g 
whence by (6.4) f=d+g. Here h+d=km and by substitution the final equa- 
tion becomes 


(6.5) (h+d)+g=h+(d+g) 


where h, d, g are all arbitrary. Since by Theorem 5.4 there is a zero for 
addition and in an equation r+s=# any two of r, s, ¢ determine the third 
uniquely, the associative law (6.5) is all that is needed to prove that addition 
forms a group. It is readily verified that if addition forms a group, then the 
last of equations (6.2) is a consequence of the others. Hence Theorem L for 
A, B, M, N=(1)=C fixed is equivalent to the linearity (6.1) of the ternary 
relation and the fact that addition is a group. Note that this does not require 
the addition to be an abelian group. Note in Figure 4 that permutations of 
M, N, B do not alter the configuration but that A plays a special role. 

Let us apply Theorem L again with A as the center of the pencil, 
y=constant; B the center of the pencil, x =constant, M=(m), N=(1)=C. 
If we take X =(0, 0), R=(a, 0), and Z=(0, b) with a, b arbitrary we find 
that XN is y=x, XM is y=xm, RN is y=x—a, RM is y=xm—am. Hence 
Y=(b, bm), T=(c, 6) where b=c—a, whence c=b+a, T=(b+a, 6). Here 
S=(d, bm) where bm=dm—am, whence bm+am=dm. By application of 


(6.3) 


km+g=h+f, | 

| | 
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Theorem L, BST are collinear whence d=)+<a and we have 
(6.6) bm + am = (b + a)m 


the right distributive law. 

Let us apply Theorem L again with M as x=constant, N as y=constant, 
A=(1), B=(m). We may take X =(0, 0), R=(a, a) and ASY as y=x+b 
where a and 6 are arbitrary. Then S=(a, a+6), Y=(0, 6), whence BYZ is 
the line y=xm+b and Z=(c, 0) where 0=cm+0d. Then ATZ is the line 
y=x—c and since RTN is y=a, T=(a+c, a). By Theorem L, BST are col- 
linear. Let its equation be y =xm-+d where d is determined by a+) =am-+d. 
The conclusion is a= (a+c)m+d. Hence if 0=cm+b and a+b=am-+d then 
a=(a+c)m+d. Instead of considering a and 6 as the arbitrary elements we 
may take 6 and d as the arbitrary elements. Here by (6.6) and the fact that 
addition forms a group we have cm=—b, —am+a=d—)b, a=am+cm-+d, 
—am+a=-—b+d, whence d—b= —b+d and so 


(6.7) b+d=d+b 


for arbitrary 6 and d, the commutative law of addition. Hence the natural ring 
R of the plane 7 satisfies all the conditions of Theorem 5.8 and by Theorem 
5.9 this ring is a Veblen-Wedderburn number system with a unit. 


THEOREM 6.3. If configuration Theorem L is satisfied in a plane x with Le 
fixed for all choices of A, B, M, N on L,, with three of these as the center of the 


pencil x = constant, the center of the pencil y = constant, and the point (1), then R, 
the natural ring determined by a fundamental quadrilateral A (center of x =con- 
stant), B (center of y=constant), O(0, 0), and I(1, 1), ts a Veblen-Wedderburn 
number system with unit. Conversely if is a plane determined by a Veblen- 
Wedderburn system, then configuration Theorem L is valid for all choices of 
A, B, M, Non Le. 


Proof. In establishing rules (6.1) to (6.7) we have used Theorem L only 
with A, B, M, N as including the three points A, B, C=(1) determined by a 
fundamental quadrilateral A, B, O, J. This proves the first part of the theo- 
rem. 

The converse part of the theorem is stronger than the first part, in that 
the points A, B, M, N of the configuration, apart from lying on L.., need bear 
no special relation to the coordinatizing fundamental quadrilateral A, B, O, J. 
As noted in Theorem 5.9 the translations (x, y)—>(x+s, y+é) are collineations 
of r leaving the points of L,, fixed. Hence without loss of generality we may 
take X=(0, 0) in verifying the validity of Theorem L. Suppose A =(p), 
B=(q), M=(m), N=(m) is a choice of four distinct points on L,. Here 
R=(a, ap) with a arbitrary and Y=(6, bm) with bd arbitrary. Line equations 
are as follows: XRA: y=xp, X YM: y=xm, XZN: y=xn, RSM: y=xm—am 
+ap, RTN: y=xn—an+ap, BYZ: y=xq—bq+bm and ASY: y=xp—bp 
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+bm. Here if S=(c, d) and Z=(e, f) we have 
(6.8) d=cm—am+ap, d=cp—bp+bm, f=en, f = eq — bg + bm. 


From the first two of these we have (c—a—b)m=(c—a—b)p whence, since 
m#p, c—-a—b=0 and c=a-+-b, whence d=bm+ap, S=(a+b, bm+ap). 
Here the equation of ATZ is y=xp—ep+en. Hence if T is (g, h) we have 
h=gn—an+ap,h=gp—ep+enand, solving as forcandd, g=a+e,h=ap-+en, 
T =(a+e, ap+en). It remains to show that BST are collinear. The equation 
of BS is y=xq—agq—bq+bm-+ap. Substituting the coordinates of T we must 
verify that ap++-en = (a+e)qg—aq—bq+bm-+ap. Here substituting for en from 
(6.8) we have ap+-eq—bg+bm =aq+eqy—ag—bq+bm-+ap, which is a true 
identity and so BST are collinear. 

Since the center of the pencil y=constant may be represented by (0), 
this verifies the validity of Theorem L for all choices of A, B, M, N on L. 
except those including the center of the pencil x=constant as one of the 
points. This requires two additional verifications, both similar to that given 
above but somewhat simpler. 

This is an affine theorem in that L,, is fixed but no points on L,, are fixed. 
Note that for every choice of fundamental quadrilateral A, B, O, I with A and 
B on L,, we get a Veblen- Wedderburn number system, but that these systems 
need not be algebraically isomorphic. For two different choices of the funda- 
mental quadrilateral it will be found (providing we alter the coordinates 
suitably) that addition is the same but that one multiplication may be found 
from the other by a rule on the order of (5.12), which is the special rule when 
A,=A:, B,= By. In appendix II are examples of non-isomorphic systems de- 
fining the same plane. The number of non-isomorphic systems is clearly con- 
nected with the nature of the collineation group of the plane. The collinea- 
tion group need not be doubly transitive on the points of L,., but in these 
examples and others not here given the group is simply transitive on the 
points of L... It would be interesting to investigate the collineation groups of 
these planes and the geometric equivalence of the Veblen-Wedderburn sys- 
tems more thoroughly. 

Now let us consider those planes in which Theorem L is a universal theo- 
rem, permitting the line ABMN to be arbitrary. Naturally the laws holding 
for the affine theorem will also hold for the universal theorem. Let us first 
take A=A, B=O, M=(0, a), N=(0, 6), X=B, R=(m) and let the line ASY 
be x =c. Here a+b, c¥0, and are arbitrary. Here the equa- 
tions of the lines are RSM: y=xm+a, RTN: y=xm+b, MYX: y=a, NZX: 
y=b. Here if Y=(c, a) and a=cr, BYZ is y=xr, whence Z=(d, 6) with 
b=dr. Then S as the intersection of ASY and RSM is S=(c, cm+a), while T 
as the intersection of ATZ and RTN is T=(d, dm+6). The theorem then 
states that BST are collinear and so if cm+-a=cs determines s, then dm+6 
=ds. Take c, d, r, m instead of a, b, c, m as the arbitrary elements, The 
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theorem becomes: if cm-+-cr=cs (c#¥0), then dm+dr=ds. Putting c=1 we 
have m+r=s, whence 


(6.9) dm + dr = d(m+r), 


the left distributive law. Clearly this law is a consequence of the left dis- 
tributive law and so is equivalent to it. The two distributive laws have as 
immediate consequences 


(6.10) — am = (— a)m = a(— m). 


Finally let us apply Theorem L with A=(0, 0), B=B, M=(a, 0), 
N=(b, 0), X=A, R=(0, c) and Y=(a, am) where a+b, c¥0, 
m0 are arbitrary. Here Z=(b, am). Line equations will be: ABMN: y=0. 
ARX:x=0, ASY: y=xm, ATZ: y=xn where am=bn, MSR: y=xp+c where 
O=ap+c, MXY: x=a, NRT: y=xq+c where 0=bq+c, NXZ: x=b, BYZ: 
y=am. We now find S=(d, dm) where dm=dp+c and T=(e, en) where 
en =eq+c. The theorem that BST are collinear yields en=dm. The relevant 
equations are 


am = bn, 0=ap+e, 0 = 


(6.11 
dm =dp+e, en =eq+c, en = dm. 


We may eliminate c from these equations and obtain 
(6.12) am=bn, ap=bq, ap=dp—dm, ap=eq—en, en = dm, 


where a, b, m, p are arbitrary and the last equation is a consequence of the 
first four. If we put a=1, p=1, m=1—b, we find that bg=1, db=1, n=q—1, 
e=1, and finally g=d, whence bg =gb=1 and we may write g=)-"', 


(6.13) bb-? = bb = 1, 


stating that the right and left inverses of an element are equal. (The ex- 
cluded possibility m =0 corresponds to )=1, where clearly b-'=1.) Now put 
a=1, p=1, m=1—br where and are arbitrary. This leads to g=d-"', 
n=b-!—r, d=(br)—', e=r—" and finally 


(6.14) (br)—? = 


Here the excluded possibility m =0 corresponds to br = 1 which is covered by 
(6.13). 

Again let us put m=a~' and choose p so that ap=bg=q—b-". We have 
immediately ap whence e=1. The last equation becomes 
b-'=da~ and combined with ap =g—b-'=dp—da™ leads to g=dp. The sys- 
tem of equations now reduces to 


(6.15) ap=q— bd, ap = bq, q = dp, b- = da“. 


. 
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Here b(ap) =b(g—b-') =bg—1=ap—1 whence (1—5b)(ap)=1, ap =(1—5)-* 
and ap—1 = (1—6)-!—1 = = = b) 
= (b-!—1)-'. Also and 
d(p — a) = (1 — 8), 
d[a-(ap — 1)] = (1-8), 
— 1)*] = (1 — 8), 
— 1)a] = (1 — 6)", 
— = 1 — 3B, 
(ba — = 1 — 
(b—a)d-! — = 1 — b. 
But ad-!=b by (6.14) and (6.15) whence (b-'a)d-'=1, and so b-!= (b-'a)a—". 
By (6.14) we have also b=[(b-'a)a—!]-!=a(a—"b). We write these two re- 
stricted associative laws in the form 
(6.16) a—(ab) = b = (ba)a“. 


It is quite easy to show that (6.14) is a consequence of (6.13) and (6.16). A 
Veblen-Wedderburn system satisfying also the left distributive law (6.9), the 
law of unique inverse (6.13), and the weak associative laws (6.16) is an alter- 
native field(*). The reader may find it an interesting exercise of ingenuity to 
show that from these laws the last of equations (6.12) is a consequence of the 
first four, but it is not necessary for our purposes to do so. 


THEOREM 6.4('*). If Theorem L is satisfied for all choices of A, B, M, N 
ow two lines in a plane 1, then Theorem L is a universal theorem in x and any 
natural ring associated with w satisfies 


a-mob = am + b, a(b +c) = ab+ ac, 

(a+b) +c=a+(b+0), (a + b)m = am + bm, 
a+b=6+4, al = la = a, 
a+0=0+¢a=4, ao =a-4=1, 
a+(—a)=(-—a)+a=0, a (ab) = b= 2 #0, 


(8) R. Moufang [6, 7], M. Zorn [1, 2]. 

(9) R. Moufang derives alternative fields from the theorem of the complete quadrilateral, 
most of the geometric part of her proof being in her third paper. Her Theorem 3a [Moufang 3, 
p. 762] is essentially the same as Theorem L in this paper. But her derivation assumes order 
and her method of proof is certainly not valid if the diagonal points of every quadrilateral are 
collinear. (This is essentially the geometric configuration corresponding to characteristic in the 
coordinatizing ring.) Curiously enough she does not seem to have observed that her Theorem 2, 
after renumbering vertices, is identical with her Theorem Dg (Theorem 1), but gives proofs of 


their equivalence. 
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and is an alternative field. The collineation group of © is transitive on the tri- 
angles of x. 


Proof. We have shown that if Theorem L is satisfied for all choices of 
A, B, M, Non L, and on x =0, then rules (6.17) are valid for the natural ring 
R determined by the quadrilateral A, B, O, I. In there exist the following 


collineations: 
(a) Translations: 


(x, y) (x +7, 9 +5), 
A-A, 
(m) — (m), 
y = am +b—y = xm+ (rm + b — 5). 


(8) The reflection: 
(x, y) = (y, *), 
(m) = (m-") 
x=cRy=e, 
y= am+bey = xm — bm" 


A-A, 

(m) — (m + 1), 
(x, y) (x, + 27), 
x=c—-x=C, 


y= am+b—y = x(m+r) + 


(6) The reflection: 


(m) = (0, m), 
(x, y) = (x 0), 
(a # 0), 
y= am+bey = xb +m. 


The validity of (a) has already been noted in Theorem 5.9. (8) is easily veri- 
fied by observing that if v=um+b, and 


(m # 0), 
(m # 0). 
(y) 
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u=vm~'—bm-"'. For (y) we observe that if y=xm-+5, then by the left dis- 
tributive law y+-xr =x(m-+r)+5. For (6) the verifications are trivial except 
to show that from v=um+), u<0, =u-"(um) +u- 

The collineation (6) takes L., into x=0. (a) takes x=0 into x=r, an 
arbitrary line through A different from L,,. (8) takes the pencil through A 
into the pencil through B. (y) takes the pencil through B into the pencil 
through an arbitrary (r). Hence the collineation group of 7 is transitive on 
the lines of z, and an arbitrary line may be taken into L... Hence since 
Theorem L is valid for all choices of A, B, M, N on L.., by Theorem 6.3, it 
is a universal theorem. Hence for an arbitrary choice of the fundamental 
quadrilateral, Theorem L will be satisfied for ABMWN on the new L,, and the 
new x =0 and so the natural ring wil! satisfy the rules (6.17). 

Consider any triangle PQS of x. By a collineation we may take the line 
PQ into L,.. By a combination of (7), which leaves A fixed and is transitive 
on the remaining points of L, and (8), which interchanges A and B, we may 
take two arbitrary points of L., into A and B. Then (a) leaves A and B 
fixed and takes an arbitrary point not on L,, into O(0, 0). Hence there is a 
collineation taking PQS into ABO and the collineations of m are transitive 
on the triangles of z. 

It is not known to the author whether or not the collineations of 7 are 
transitive on quadrilaterals. If they are, an arbitrary choice of the funda- 
mental quadrilateral will yield the same alternative field as the natural ring 
for x. If they are not, every natural ring will be an alternative field, but non- 
isomorphic alternative fields will determine the same plane. One may be ob- 
tained from another by the rule (5.12). It may be shown that the collineations 
of x will be transitive on quadrilaterals if and only if for every r~0 there 
exists a function ¢,(u) such that 


(6.18) or(a) + = +b), e(am) = ]. 


Obviously, when the multiplication is associative ¢,(u)=ru satisfies these 
conditions. 

Theorem L is a special case of the Theorem of Desargues. If we assume 
the Theorem of Desargues as a universal theorem, then not only the rules 
(6.17) must be valid for the natural rings associated with the plane, but, taking 
the origin as the center of perspectivity and L,, as the axis of perspectivity, 
it is easily shown that multiplication is associative, and that R is a skew-field. 
Then the homogeneous representation of 7 is easily established and the col- 
lineations (5.2) show that every Desarguesian plane is determined by a 
unique skew-field, and hence is an analytic plane. Conversely the verification 
of the validity of the Desargues Theorem in an analytic plane is a straight- 
forward and relatively simple calculation. But to prove that every Desargues- 
ian plane is an affine plane it is simpler to rely on the methods of standard 
texts than to use Theorem 6.3 which depends on the weaker Theorem L. 


| 
| 
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APPENDIX I 


THEOREM. Given permutations S on a finite number n of letters such that 
(a) S is doubly transitive, (b) if M, LCS and M#L, then ML— fixes at most 
one letter, then there are n—1 permutations taking a given letier i into a given 
letter j and (c) if L does not take i into j there is exactly one M taking i into j 
such that ML— displaces all letters. 


Denote the letters by 1, 2,---, ». Consider the permutations 7(i, 7) 
for fixed i,j, and ki. a, may take any value 1,---, 
except j7. Hence there are at least m—1 permutations T(z, 7) since S is doubly 
transitive. If two T(i, 7) permutations A, B have the same value for a,, 
then AB-' fixes both 7 and k and by condition (b) A =B. Hence there are 
exactly m—1 permutations T(z, 7). Suppose Z does not take 7 into j, 
The n—2 permutations with t#i, k and (j,) 
from ZL are different from LZ and each other, since LZ dces not take i 
into j, and if two of these were the same WN then NL would fix two letters. 
Call the remaining T(z, 7), M. Then M takes i into 7 and ML~ displaces all 
letters. 

If S is a set of permutations on infinitely many letters satisfying (a) and 
(b) then (c) may be violated in either of two ways. Either for a given L and 
(j) there may be no M taking i into j such that ML~' displaces all letters or 
there may be more than one. Both possibilities actually arise. Consider per- 
mutations on the non-negative integers 0, 1, 2, - - - . Suppose given a finite 
set Pe= where the a’s 
and 6’s are subject to the following conditions: (A) (270775), c¥e, d¥f, 
occurs in at most one P for any combination of c, d, e, f and every combina- 
tion with c+d+e+f3N does occur; and (B) For each P;, aa, --- are dis- 
tinct and include 1, 2,---, N and by,--- are distinct and include 
1, 2, ---, N. Then it is possible to extend P, - - - , P, for N=k to a larger 
system P,, - - - , P; which satisfies (A) and (B) for N=k+1. In a countable 
number of steps we have then constructed a set S of permutations satisfying 
(a) and (b). If all (lngicg¢c) with c+d+e+f=k+1 occur in Pi,---,P, 
then no extension is needed to satisfy (A) for N=k-+1. If one such combina- 
tion has not occurred write P,41=(%}) and with a finite number of such ad- 
junctions (A) is satisfied for N =k+1. Now if a given i Sk+1 has not occurred 
in some P as an a, adjoin to this P the combination (},) where if » is sufficiently 
large (A) will not be violated, since m must satisfy only a finite number of in- 
equalities. Similarly adjoin (7) to any P in which 1<+1 has not occurred 
as a b. if m is large enough this may be done without violating (A). Hence in 
a finite number of steps we may extend P;, - - - , P,, satisfying (A) and (B) 
for N=k, to P;,---, P; satisfying (A) and (B) for N=k+1. Since we do 
not alter the letters originally in the P’s but merely adjoin new letters and 
new P’s there is no ambiguity in considering the result of a countable number 
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of steps, and this is a set S satisfying (a) and (b). 
If we start with © 


1, 2,3, 4, P. — 27, 2r+1, 
4,3,+ ar, 3 1,0, 3,2, +++, 2r+1, Qr, 


we have (A) and (B) satisfied for k=2. Ask ~, P;, P2, Ps; remain unchanged, 
and the complete set S satisfies (a) and (b). But here both P, and P; take 0 
into 1 and P,P;' and P;P;? displace all letters, thus violating (c). On the 
other hand if we start with Pyp=( 307%"), P2=(¢i3) we have (A) and 
(B) satisfied for k =2. At each stage if some P; does not fix any letter we may 
adjoin (¢) to P; with a sufficiently large c. In this case there is no P such that 
PP?" displaces all letters and (c) is violated the other way. 


APPENDIX II 


Of the Veblen-Wedderburn systems with nine elements, there are, besides 
the finite field, four non-isomorphic systems. In all four the additive group is 
the elementary abelian group of order 9 and the elements may be represented 
as 0, 1, 2, a, a+1, a+2, 2a, 2a+1, 2a+2, where we add elements ita+j by 
taking + and 7 modulo 3. The multiplication tables of the four systems are 
given below. Since 0x =x0=0, 0 is omitted from the table. The product uv 
is in the row beginning with u and in the column under v. The element 2 is 
associative on the left since (2x)y = (x+2x)y=xy+xy =2(xy), but need not be 
in the center (System U). 


R. The group system 


2 

1 

2a 
2a+2 
2a+1 

a 
a+2 
a+1 


a a+1 

2a 2a+2 

2 2a+1 
a+2 2 
2a+2 a 

1 a+2 
a+1 2a 
2a+1 1 


S. 


a+1 
2a +2 
1 
a+2 
2a 
2 
a 
2a+1 


a+2 
2a+1 
a+1 
2a 
2 
2a+2 
1 
a 


=z+1 


a+2 
2a+1 
2a+2 
1 
a 
a+1 
2a 
2 


2 

2a+2 

2a+1 

a+1 
1 


273 | 
| 
1 2a 2a+1 2a+2 
2 a a+2 
a 1 2a+2 a+2 
a+1 2a+1 a 1 
a+2 a+1 1 2a 
2a 2 a+l1 2a+1 
2a+1 2a+2 2 a 
2a+2 a+2 2a 2 
1 2 a 2a 2a+1 2a+2 
2 1 2a oe. a+2 a+1 
a 2a a+1 2 2a+1 
a+1 2a+2 2a+1 2a a 
a+2 2a+1 1 a+1 2 
2a a 2a+2 1 a+2 
2a+1 a+2 2 2a+2 1 | 
2a+2 a+1 a+2 a 2a | 
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T. 3s?=22+1 


a+1 a+2 
2a+2 2a+1 
a+2 1 
2a a 
1 2a+2 
2a+1 2 
2 a+1 
a 2a 


U. 2 not in center 


a+1 a+2 

2a 2a+2 2a+1 a 
a+2 1 2a a+1 

a+1 2a 2a+2 a+2 2 1 
a+2 2a+2 2 2a a+1 2a+1 
2a a+2 2a+1 2 a 2a+2 
2a+1 a+1 1 a 2a+2 a+2 

2a+2 a a+1 2a+1 1 2 a+2 


System R is a group system which may be derived from a doubly transi- 
tive group on 9 letters in which only the identity fixes two letters, as in Theo- 


rems 5.6 and 5.7. The subgroup of elements fixing one letter (the multiplica- 
tive group of the Veblen-Wedderburn system) is the quaternion group. 

In systems S and T the elements 0, 1, 2 are in the center and in S every 
other element 2z satisfies 22=z+1, while in T every other element z satisfies 
z?=2z+1. In system U the center consists solely of the elements 0, 1. In 
S, T, Uit is immediately observed that (aa)a¥a(aa). 

Systems S and T suggest a method of constructing Veblen-Wedderburn 
systems J which turns out to be valid. Given a commutative field F and a 
polynomial with coefficients from F, 2*—rz—s, irreducible over F. Let the 
elements of J be au+5, a, CF, and let addition in J be defined by (au+5) 
+(cu+d) = (a+c)u+(b+d). For multiplication let c(au+b) = (au+5b)c=acu 
+be and c(wv) = (cw)v if a, b, cC F. Also let 2? =rz+s if z=au+b, a¥0. These 
rules, together with the right distributive law, completely determine the multi- 
plication table for J. It may be verified with a certain amount of tedious 
calculation that J then satisfies all requirements of Theorem 5.8 and is a 
Veblen-Wedderburn system. J will not be a field or even a skew-field unless F 
is the finite field with two elements. 

Although the four systems R, S, T, U are not algebraically isomorphic, 
they all define the same projective plane. Systems S, T, U define planes 
#(S), r(T), r(U) isomorphic to each other with B-—B, and each may 
be obtained from the other by an appropriate application of rule (5.12). To 


1 2 a 2a 2a+1 2a+2 

2 1 2a a a+2 a+1 
a 2a 2a+1 2a+2 a+1 2 

a+1 2a+2 1 a+2 2 2a+1 
a+2 2a+1 a+1 2 2a a 
2a a a+2 e+1 2a+2 1 
2a+1 a+2 2a+2 1 a 2a 

2a+2 a+1 2 2a+1 1 a+2 
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map onto take a=1, s=2 in (5.12). r(T)—>7(S), (x, 2y)—>(x, 
The equation (x o d)2 =x(d2) defines a product x od in S in terms of the multi- 
plication in 7. To map onto take a=1, s=2a+2 ir. (5.12). 
[x, y(2a+2) ]—[x, y]. The equation (x d)(2a+2) =x[d(2a+2) | 
defines a product x o d in S in terms of the multiplication in U. There is'no 
isomorphism between 2(R) and 2(S) leaving A and B fixed. To map 2(S) 
onto 7(R) let us take the quadrilateral A, (2), (0, 0), (1, 0) in r(.S) and use it 
as a fundamental quadrilateral to coordinatize the plane. By Theorem 6.3 
this will lead to a Veblen-Wedderburn system and the lines in the plane will 
be determined by equations x=c, y=xm+5. Keeping x coordinates un- 
changed this leads to (x, y)—>(x, x+y), (m)—>(1+-m). In the new ring addition 
is found to be the same as in S, and multiplication u o m is given by the rule 
uo m=u(m—1)+u. The new ring is the system R and so we have mapped 
a(S) onto r(R). Hence we have shown that all four systems define the same 
plane z. 

What is the collineation group of +? If there were a collineation of + 
taking L.. into another line then Theorem L would be satisfied for all 
A, B, M, N on two lines, and hence by Theorem 6.4 every natural ring 
of x would be an alternative field. Since this is not the case, the collineations 
of x leave L,, fixed. Taking 7 as represented by R, the following mappings of 
points are collineations: 


A — (2), 
B.= (1), 
(1) — (0), 
(2) +A, 
(m) —> (2m) (m # 0, 1, 2), 
(x,y) 9), 
A-A, 
(m) — (rms) (r, s fixed parameters 0), 
(x, y) — (x7, ys), 


(m) (m), 
(x, y) > (x+r, 9 +5) (r, s fixed parameters). 


These generate the complete collineation group of which is of order 
10-8-6-4-2-81 =311,040. The collineations induce-a transitive permutation 
group on the points of L., of order 10-8-6-4-2 =3840. In the symmetric group 
on the ten points of L., A, (0), (1), (2), (@), (@+1), (@+2), (2a), (2a+1), 


276 MARSHALL HALL : [September 


(2a+2), this group is the centralizer of the element [A, (0)][(1), (2)] 
[(a), (2a) ][(@+1), (2a+2)][(@+2), (2a+1)]. 


The full projective group of w is the symmetric group on the points of a 
line, and hence is of relatively little interest. 
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ONTHE OSCILLATION OF DIFFERENTIAL TRANSFORMS. III 
OSCILLATIONS OF THE DERIVATIVE OF A FUNCTION 


BY 
A. C. SCHAEFFER 


1. It has been shown by S. Bernstein that if a function which is defined 
in an open interval is infinitely often differentiable (belongs to C”) and each 
of its derivatives is of constant sign then the function is analytic in the inter- 
val('). As a possible generalization of this result Pélya raised the following 
question(?): If im an open interval a function is infinitely often differentiable 
and no derivative changes sign more than a given number of times is the function 
analytic in the interval? The present paper contains an answer to this question 
and to several related questions. These results are consequences of an in- 
equality (Theorem I of the present paper) which relates the magnitude of the 
first derivative of a function to the number of times some higher derivative 
changes sign. All functions considered in this paper will be supposed real for 
real values of the variable. 

In the case in which f(x) is infinitely often differentiable over the interval 
(— ©, «) let VN; be the maximum number of sign changes of f(x) over any 
interval of length a, where a is some arbitrary but fixed positive number. 
Pélya and Wiener(*) made the additional hypothesis that f(x) is periodic 
and found that if N,=O(1) then f(x) is a trigonometric polynomial, while if 
N,=0(k"?) then f(x) is an entire function. Szegé obtained a new proof of these 
results and indeed obtained the sharper theorem(‘) that if Ni<k/log k for 
large k then f(x) is an entire function. Hille(®) has obtained analogous theo- 
rems for a very general class of differential operators and functions which 
satisfy appropriate conditions. 

Let us consider for the moment functions which are bounded and infinitely 
often differentiable over (— ©, ©), but are otherwise completely general. 
It is shown in the present paper that if N,=O(1) then f(x) is an entire func- 
tion of exponential type, and if N,=O(log’k), y<1, then f(x) is an entire 
function(*). In the case in which N; is bounded this is a generalization of 


Presented to the Society, November 28, 1942; received by the editors February 6, 1943. 

(*‘) S. Bernstein, Legons sur les propriétes extrémales - ++, Paris, 1926, pp. 196-197. 

(?) At the Stanford University Symposium, August 12, 1941. 

(*) G. Pélya and N. Wiener, On the osci::~tion of the derivatives of a periodic function, Trans. 
Amer. Math. Soc. vol. 52 (1942) pp. 249-256. 

(*) G. Szegé, On the oscillation of differential transforms. 1, Ibid. pp. 450-462. 

(®) E. Hille, On the oscillation of differential transforms. 11. Characteristic series of boundary 
value problems, Ibid. pp. 463-497. 

(®) A.C. Schaeffer, Oscillations of the derivatives of a function, Proc. Nat. Acad. Sci. U.S. A. 
vol. 28 (1942) pp. 62-63. The results are there stated without proof. 
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one of the theorems of Pélya and Wiener mentioned above, since an entire 
function of exponential type which is periodic is a trigonometric polynomial. 
In the case in which WN, can become infinite as & increases, the results of the 
present paper overlap those of Pélya and Wiener: we consider a more general 
class of functions, but, as indicated above, we allow only a much slower rate 


of growth of Nj. 
2. The proofs are based on the following fundamental result. 


THEOREM I. In an interval a—~L<x<a+L let f(x)EC*, n2=2, and let 
If 
(1) | f’(a) | = 
then f(x) changes sign at least n—1 times in the interval. 


Several inequalities are known for the derivatives of functions which 
satisfy the conditions of Theorem I and have non-negative nth derivatives(’). 
We show by an example that there is no constant A=A(m) such that 
the inequality | f’ (a)| 2AM/L will imply that f‘(x) has » (or more) varia- 
tions in sign. For, with 0<a<1, let 


(1/a)(1 — 2%/a%)", <a, 
r= >a. 


Then if we set {(0) =0 it follows from simple calculations that, over the en- 
tire real axis, f(x) EC", | f(x)| $1, and f(x) has n—1 changes in sign. But 
f'(0) =1/a, which is unbounded as @ approaches zero. 

The following lemma will be sufficient for our purpose, although a more 
precise formulation is known(*). 


Lemma I (S. Bernstein). If in a closed interval of length 2L, f(x) EGC* and 
| f(x)| S1 then there is at least one point in the interior of the interval for which 


| f(x) | < (2/L)»nl. 


One proof, which is undoubtedly known to many, is as follows. Let the 
interval be [1, 1] and let 
nt+1 


g(x) = f(x) — f(x) — 


where Tasi(x) = cos ((m + 1) arc cos x) = 2*%(x — x) -- (x — Xn41), 


(*) E. Landau, Uber einen Satz von Herrn Esclangon, Math. Ann. vol. 102 (1930) pp. 177- 
188. R. P, Boas, Functions with positive derivatives, Duke Math. J. vol. 8 (1941) pp. 163-172. 
R. P. Boas and G. Pélya, Influence of the signs of the derivatives of a function on its analytic 
character, Duke Math. J. vol. 9 (1942) pp. 406-424. 

(*) S. Bernstein, loc. cit. p. 10. J. Shohat, A simple proof of a formula of Tchebychef,, 
Tohoku Math, J. vol. 36 (1932-1933) pp. 230-235. R. P. Boas and G. Pélya, loc. cit. pp. 413- 
414. With more detail the proof in the text can be made to yield more information. 
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x,=cos ((v—1/2)m/(n+1)), is the (w+1)st Tchebycheff polynomial. Then 
g(x) vanishes at +1 or more points, so repeated use of Rolle’s Theorem 
shows that its mth derivative has at least one zero, g‘(a)=0. Since 
Tni1(x)/(x—x,) is a polynomial of degree m with leading coefficient 2", we 
obtain 


n+1 


= f(a) — f(a) 


The summation is bounded by 1 since | T41(x,)| =(n+1)(1—x?)-¥22>n+1, 
and the lemma follows. 

Proof of Theorem I. It is sufficient to consider the case in which M=1 
and the interval is (—1, 1). We suppose that f’(0) S —(10”)?* and prove 
that f‘™(x) must change sign at least n—1 times in (—1, 1). According to 
Lemma I, each closed interval of length 1/(5) lying in (—1, 1) must contain 
at least one point in its interior where 


(2) | f(x) | (20n)*k! < (20n)"n"; k=0,1,2,--++,m. 
The point x in this inequality depends on & and on the interval. 

o(x) = f(x)(20n)—"n-*. 
Then 
(3) ¢’(0) s — 10*20-" = — 
and from inequality (2) it follows that each closed interval of length 1/(S) 
contains at least one point in its interior where 
(4) | | <1; k= 0,1,2,---,%. 

It is to be shown that for k=1, 2, - - - , m there are points Xo, - , 
depending on k, such that 

— k/(Sn) < % < < +--+ < < R/(Sm), 
(5) (— (x,) > 3, yv=1,2,---,h, 
<1, < 1. 


In the case k=n these relations will imply that f‘(x) changes sign at least 
n—1 times, so the theorem will follow. In the case k=1 these relations are a 
consequence of inequalities (3) and (4) where we have x; =0. We now suppose 
that (5) is true for some k, 13% Sn—1, and proceed by induction. 

Relations (5) imply that 0<x,4:—x,<2k/(5n")<2/5. The mean value 
theorem shows that there is a point y; such that 


(x1) — = (41 — to < 
The left-hand side is less than —3+1= —2 so 
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< — 2/(2/5) < — 3. 
Again using the mean value theorem we have (if k>1) 
6 < — = — (y2), #1 < ya < 
from which we conclude that ¢“+”(y2) >3, and so on. Thus there are points 


— k/(Sn) < < y2 << < Year < 


such that 
(— Y(y,) > 3; =1,2,---,k+1. 


Then each of the intervals [y:—1/(5), y:] and ye41+1/(5) ] contains 
at least one interior point where | pietn| <1. We call these points yo and ya+2 
respectively and (5) is true with k replaced by k+1, which completes the 
induction. 


3. THEOREM II. If f(x) EC” in an open interval and there exists an integer p 
such that no derivative of f(x) changes sign more than p times then the function 
ts analytic in the interval. 


Proof. It is dations to show that the function is analytic in the interior 
of every closed interval in the given interval. (In this way we avoid consider- 
ing the possibility of the function becoming infinite near the end points.) Also, 
by a simple transformation the problem is reduced. to the case in which the 
interval is (—1, 1) and |f(x)| S1, -—1<x<1. 

We show by induction that 


(6) | f(x) | —| x] )-*, -1<2x<1, 


for k=0, 1, 2, - where a= 3{10(p+2) } 2¢+2), This inequality i is true for 
k=0. If it is true for some &20 and if |xo| <1, then in the interval 


| x—xe| <(1—|xo|)/(2+1) we have 
(1 —| < (1 —| — + 1))-* < 301 | )-*, 
so, by (6), 
| f(x) | < 3a*k1(1 — | xo| )-*. 
Theorem I with L=(1—|xo|)/(e+1) and »=p+2 implies that 
| (x0) | < {10(p + — | xo| + 1)/(1 —| xo!) 


| (a) | < + — | 


Thus (6) is true for all &; and Taylor’s expansion of f(x) about any point in 
the interior of (—1, 1) converges to the function in an entire neighborhood 
of the point. 
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THEOREM III. Let f(x) EGC” (— ©, ©) and let the number of variations in 
sign of f(x) in every interval of length a be less than B. If 


(7) lim | log | f(2)| < 


then f(x) is an entire function of exponential type. Here a and B are arbitrary 
but fixed positive numbers. 

Proof. From (7) we conclude that there are constants A and B such that 
f(x)| <Ae®l*! for all real x. There is an integer » such that no derivative of 
f(x) changes sign more than p times in any interval of length 2. 

We suppose that for some k 


(8) | f(x) | < 


where 
(9) = e7{10(p + 2) } 


and use induction. Now in an interval of length 2 with center at xo, inequality 
(8) shows that 
| f(x) | < 


Then Theorem I shows immediately that (8) is true for the next higher in- 
teger. Since (8) is true for k =0 it is true for all k. 
Taylor’s expansion of f(x) about the origin then shows that it is an entire 


function of exponential type A, 


which proves the theorem. 

4. The constant A determines the rate of growth of f(z), and in relation 
(9) we have an explicit expression for A in terms of the frequency of the varia- 
tion in sign of f(x). This expression is not the “best possible,” but, at least 
in the case in which f(z) is bounded on the real axis, a “best possible” inequal- 
ity for the rate of growth of f(z) can be obtained by use of function theory. 


THEOREM IV. Let f(x)EC* (— ©, ©) and let it be bounded. If no derivative 
of f(x) changes sign more than p times in any interval of length w then f(x) is 
an entire function of exponential type p, 


| f(z) | 


We prove the theorem with an integer, it will then follow for non- 
integer p. The proof will depend on two lemmas. 
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Lemma II (Pélya-Szegé)(*). Let f(z) be an entire function which satisfies 
f@) f(z) | M, 


then 
| f(z) | 


Lemma III (S. Bernstein) (!*). Under the conditions of the previous lemma 
| | < Ma, ow, 
and the equality holds only if 
f(z) = sin (az + 8). 

Note that if f(z) satisfies the conditions of Lemma II then its derivative 
is also an entire function of exponential type a. Hence if to f(z) we apply 
Lemma III and then Lemma II we obtain |f’(z)| S Mae*!*! which is appar- 
ently a stronger statement than Lemma III. 

Proof of Theorem IV. A function which satisfies the conditions of Theo- 


rem IV is, according to Theorem III, an entire function of exponential type. 
Let it be of precise type a, that is let 


a = lim 7 max log | f(re‘*) |. 
6 


Then f(z) = O(e'*+!l*!) if €>0, and so by Lemma II |f(z)| S Me‘e+lv!, where 
M is the upper bound of f(x). Then letting € approach zero, we find that 
|f(2)| Me*!*!, Let 
M,= sup | 
and 


r= Tim 
to 


We first show that 
(10) r=a. 


Now f(z) is an entire function of exponential type since f(z) is, and f(z) 
is bounded by M;, on the real axis. Then, from Lemma III we find that 
Misi1SaM;, and so rSa. On the other hand, if r<a there is a constant p 


(*) G. Pélya and G. Szegé, Aufgaben und Lehrsdtze aus der Analysts, Berlin, 1925, vol. 2, 
pp. 35-36 and 218-219 (Problems 201, 202). R. Duffin and A. C. Schaeffer, Some properties of 
functions of exponential type, Bull. Amer. Math. Soc. vol. 44 (1938) pp. 236-240, where a more 
precise formulation is given (for functions which are real on the real axis). 

(*°) S. Bernstein, loc. cit. pp. 97-102, G. Pélya and G. Szegi, loc. cit. vol.2 pp. 35 and 218- 
219 (Problem 201). R. Duffin and A. C. Schaeffer, loc. cit. p. 239. In the discussion of the case 
of equality it is essential that the functions be real on the real axis. 
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such that Miy:/Mi<p<a, k2ko. This clearly implies that M,<Ap’, 
k=0, 1, 2,---, for some constant A. Then using Taylor’s expansion of 


f(z) about the origin, we have 


| f(z) | AD pt | z|*/k! = Aerltl, 
0 


which implies that f(z) is of exponential type p<a. This proves (10). 
Let ki<k2< --- be a sequence of positive integers such that 


> a(1 — 1/r). 

Then at some point f+” (x) will be near its upper bound. Let 
| | > (1 — 1/v)Mi,41, 

and let 

= f'*»)(x, + 2)/Mi,. 
Then 4,(z) is an entire function of exponential type @ and it is bounded by 1 
on the real axis, so by Lemma II 
(11) | ¢(z)| 
Also, according to the construction, 

| (0) | =| | /Ma, > (1 — > a(1 — 
Inequality (11) together with Vitali’s convergence theorem implies that 


there is a subsequence of the functions ¢,(z) which converges to an entire 
function ¢(z) with the properties that 


| o(2)| 
and 
|¢’(0)| = a. 
The case of equality in Lemma III then shows that 
¢o(z) = + sin az. 


If a>p then ¢(x) would change sign at least +1 times in an interval 
(—e, t—e) where ¢ is some sufficiently small positive number, and then 
f(x) would change sign at least +1 times in an interval of length z if k, 
is large. This is impossible, so aS and the theorem follows. 

5. In the following let NV; be such that f(x) does not change sign more 
than NV, times in any interval of length a, where a is an arbitrary but fixed 


positive number. 
THEOREM V. Let f(x)EC*(— ©, ©) and let Nx =O(log’? k), y <1. If 
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lim | «| log | f(x) | < @ 


then f(x) is an entire function of order not exceeding one. 
It is sufficient to consider the case in whicha=2. There are constants My) 
and B such that 
| f(x) | < Moe?!*!, 


If xo is any real number then f(x) is bounded by Moee#!* in the interval 
|x—xo| <1. Then in virtue of the conditions of this theorem there is an in- 
teger k such that Nz<k—2, and so Theorem I. gives a dominant for f’(x) 
of the form | f' (x)| < Mye*'*!, — © <x< «. Repetition of this argument shows 
that for k=0, 1, 2,--- 


(12) | f(x) | < om, 


and the object is to prove that the M; can be chosen to increase at a suffi- 
ciently slow rate so that f(x) is an entire function of order no larger than one. 
If y<pu<1 then for large k, 


Ni < &. 
Also, if 1<’<1 and we define m by 
(13) n = [log* &] 
then for large k, 
Nisn < log* (Rk + log* k) < n — 2. 


If & is large (12) shows that f(x) is bounded by M,e%e*!**! in the interval 
|x—xo| <1, so, applying Theorem I to the function f(x) with defined by 
(13), we have 


| (a9) | < 
Mes S 


and after some simplification this becomes 
s 


Thus 


for large Finally, 

= O((R!)*) 
for every ¢€>0, and this shows that f(x) is an entire function of order no 
greater than one. 


STANFORD UNIVERSITY, 
STANFORD UNIVERSITY, CALIF. 
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ON HOMOLOGICAL SITUATION PROPERTIES 
OF COMPLEXES AND CLOSED SETS 


BY 
PAUL ALEXANDROFF 


The purpose of this paper is to find and to study topological invariants 
which connect the homological properties of a space K with those of its closed 
subset A and of the open complement G= K\4A, and thus help to characterize 
from the homological point of view the situation of A in K. 

In the case when K is simply connected (that is when the Betti groups of 
K are zero) the problem is solved by the duality theorems of Alexander, 
Pontrjagin, and Kolmogoroff, which determine the Betti groups of G through 
the Betti groups of A. In the other special case when K is a manifold the first 
duality theorems have been obtained by Pontrjagin [11](*) and Lefschetz [9] 
in 1927-1928. All these results are special cases of the general theory to which 
the present paper is devoted and which gives the very elementary construc- 
tion (of the so-called extension- and intersection-homomorphisms, section 11) 
dominating the whole variety of duality and other situation properties. 

The complete combinatorial theory is given in Chapter I for an arbitrary 
cell complex K and its closed subcomplex A. In Chapter III the same theory 
is generalized for locally bicompact normal spaces K and their closed sets A; 
this generalization is based on an approximation process developed in Chap- 
ter II. Chapter IV deals with manifolds and gives an elementary proof (com- 
binatorial in character) of the Alexander-Pontrjagin duality in its most 
general form. 

All main results obtained are completely formulated in the first four sections 
of Chapter I (sections 11-14). Numerical consequences are given in section 16. 
Section 18 deals with the Phragmén-Brouwer problem, while in section 19 
some quite elementary examples are given as illustration. 

The elementary known facts and notations used throughout this paper 
are systematized in the Introduction; its first part contains the group theo- 
retical material, the second the needed information on complexes. Thus the 
present paper is practically independent of the previous literature on related 
subjects. There are only few references to my paper [1], and each of them 
may be read without reading the rest of that paper. 


INTRODUCTION 
§1. GROUPS AND THEIR CHARACTER GROUPS 
1. Groups, homomorphisms, duality. All groups considered are commuta- 


Presented to the Society, April 24, 1943; received by the editors February 17, 1943. 
(?) Numbers in brackets refer to the bibliography at the end of the paper. 


286 


PROPERTIES OF COMPLEXES AND CLOSED SETS 287 


tive and written additively. In topological groups only closed subgroups and 
continuous homomorphisms are allowed. The difference- (or factor-) group 
of a group X over its subgroup Xo is denoted by X — Xo. 

Let o be a homomorphism of the group X into the group Y; let ¥i:=0X 
be the image of X under o and X,=o—0 (or o~Oy) be the kernel of the homo- 
morphism g¢, that is, the subgroup of all elements of X mapped by ¢@ on the 
zero element Oy of Y. Then we often use the picture 


XD Xo 
We denote by J the additive group of all integers, by 7 the additive con- 
tinuous (bicompact) group of all real numbers reduced modulo 1. 


If B is the character group of the group A in the sense of Pontrjagin [10] 
then A is the character group of B, and we write 


(1.1) A|B. 


The groups A and B are called in this case dual and the relation (1.1) is called 
a duality. 

2. Scalar products. If A|_B, a€A, b€B, then the character b of the 
group A and the character a of the group B map respectively the elements 
a€A and b€B on the same element 


ba = ad CT 
called the product of a and b. 
Other cases in which products are defined are: 
1. When a and 6 are elements of the same (commutative) ring. 
2. When a is an element of an arbitrary group A, while 6 is an integer. 
Then for 


ab = ba =a+at+:--+a (6 times), 
a(— 6) = (— b)a = — (ad), 


and ab=ba=0 for 
The scalar product 


(x-y) = aid; 


of two linear forms x=) a;t; and y=) bit; is defined if the products a,b; are 
defined. 

3. Annihilators. Let X| Y; let AC X and BC Y be subgroups of X and Y 
respectively. If B is the annihilaior of A (in Y), that is, the subgroup of all 
elements y of Y satisfying the relation xy=0 whenever xCA, then A is the 
annihilator of B in X and we write 


XDALBCY, 
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this relation being called an annthilation. 

A fundamental theorem of the character theory is: 

THEOREM 3. The annihilation XDA LBCY implies the dualities 

(X—A)|B, (Y—B)|A. 
4. Conjugate homomorphisms. The fundamental lemma. Let 
The homomorphisms 
cofXintoY; ¢of Y into X 

called conjugate if (ox- 9) =(x- 9), whenever In this case we 


write 


or—if the images ¥;=o0X, X:=@Y and the kernels X»=o—0y, = are 
given— 
XD Xo Gz 


(4.1) 
rar. 


Most of this paper is based on the 


FUNDAMENTAL LEMMA 4. From (4.1) follows 
(where the diagonal line means the duality X;| Y:) and 
Xo| — 


(Y — Vo. 


The dualities (4.3) follow from Theorem 3 and the annihilations in (4.2), 
while the duality X; | Y; follows from Theorem 3 by means of the isomorphism 
Y, ~ X —Xo. Thus we need to prove only the annihilations in (4.2). It is suffi- 
cient to prove the first annihilation 


To this end let Xo, Then 
(x-69) = (ox-F) = 0. 


(4.3) 
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On the other hand, if x0, then there exists a CGY with 
(x-a9) = (ox-5) 0. 


Lemma 4 is proved. 

5. Remark concerning the rational case. Let # be the field of all rational 
numbers, * the corresponding n-dimensional linear space, that is, the group 
of all linear forms with n variables and rational coefficients; the group ®* is 
considered as a group with operators, the latter being rational numbers. Thus 
only linear subspaces are allowed as subgroups. If we understand now by a 
character of #* a homomorphism of #* into R, the “character group” of R* 
is again *, as follows from the definition y(x) =x(y) =(x-y) for any xER", 
yC€R", the scalar product being understood in the most elementary sense. 
Thus the duality relation | turns in this case into the identity and Lemma 4 
still holds but now becomes trivial. 


§2. CONCERNING CELL COMPLEXES 


6. Cell spaces and their subspaces. A cell space (Tucker [14], Kol- 
mogoroff [8]) is a finite set of elements, called cells, satisfying the following 
conditions: 

1. To any cell corresponds a non-negative integer, the dimension of the 
cell (denoted by a superscript: # is an r-dimensional cell or an r-cell). 

2. To any cell # corresponds a well defined cell —# of the same dimen- 
sion 7, and we have —(—?’) the cells and —?’ are called opposite (to each 
other). 

3. To any two cells # and #—' of dimensions r and r—1, respectively, 
corresponds an integer (é":#—"), the incidence coefficient of ¢’ and t’-', under 
the conditions: 


(— = — = — (et). 


It is convenient to suppose that in any pair of opposite cells of the cell 
space K a definite cell is denoted by 4 and the other by —4. Then we put 


= )- 


Let and be two cells, and s<r. We write {> or if 
cells - -- , can be found so that 


If 4 <& we say that & is a face of &. 
A cell space Q is called a subspace of the cell space K if Q is a subset of K 


and 
(a) two cells are opposite in Q if and only if they are opposite in K; 
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(b) the dimension of a cell and the incidence coefficient of two cells in Q 
are the same as in K. 

A subspace Q of the cell space K is called: closed, if t€Q, t’ <# (in K) im- 
plies t’ EQ; open, if t€Q, t’>¢ (in K) implies ¢’€Q. 

7. Chains and boundaries. An r-dimensional chain (or an r-chain) on a 
cell space K with respect to the group X (the “coefficient-group”) is a function 
x’ with values in X defined on the set of all r-dimensional cells of K, 
taking opposite values on opposite cells. The additive group of all r-dimen- 
sional chains on K with respect to X is denoted by L%(X) or by Lz. Chains 
with respect to the group J of all integers are called integer chains. 

The chain taking the value a€X on the cell # GK and the value zero on 
all cells different from +# is denoted by a? (in particular, the integer chain 
taking the value 1 on #, the value —1 on —# and the value 0 elsewhere is 
identified with the cell #). This convention allows one to set for any chain 


x" 
x= at; 


where a; is the value taken by x’ on 4. 
The scalar product 
(x"-y") = 


is defined for two chains x=) af, and y => bf under the hypotheses for 
coefficient-groups mentioned in section 2. In particular the scalar product 
(x*-&) is always defined and equal to the value of the chain x’ on the cell &. 

Let x" be any chain of the cell space K. The (r—1)-chain taking the value 
dey(x":4) on any 6-'CK is denoted by Ax’ and called the A-boundary (or 
the lower boundary) of x* (on K). The (r+1)-chain taking the value 
Dei (x"-4) on &Y'CK is denoted by Vx" and called the V- (or upper) 
boundary of x* on K; sometimes we write Axx’, Vxx" instead of Ax’, Vx" to 
show the cell space on which A, V operate. 

Obviously 


i 1 
) = = , 


and thus for any two chains x’ and y’t! 
(7.1) (Vx"- yrtt) = 


provided that these scalar products have a sense (section 2). 
Let X| X, Ly =Lz(X), =Li(X). Denote by 


He= We = 


the images; by Zx, Zx the kernels of the homomorphisms 


V(of Lx intoLx);  A(ofLx into Lx). 
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The elements of (of Zx) are called r-dimensional V-cycles (A-cycles) ; 
among all elements x’ of Lx (of Zz) they are characterized by the condition 


vx" =0; Ax’ = 0. 
From (7.1) and the Fundamental Lemma 4 follows 


is 
(7.2) 


(7.3) (Lx — Hx) |Zx. 


8. Cell complexes. A cell space is called a cell complex if for each of its 
cells =0, that is ?=0 for every r, i, k. On a cell complex AAx*=0, 
=0 for every chain x’ and therefore Hy The elements of 
Hr, Hx are called bounding cycles (V- and A-, respectively). 

9. Betti groups. These are the groups 


Ve =Vx(X)=Zx—Hx; Ax = Ax(X) =Zx — Ox; 


the first is called the r-dimensional V- (or upper) group and the second is 
called the r-dimensional A- (or lower) group of K with respect to the given 
coefficient-group (X for V and X for A). The elements of Vx (of Ax) are called 
r-dimensional Y-classes (A-classes) of K. Two V-cycles (A-cycles) zj and 2 are 
called homologous to each other on K if they belong to the same V-class 
(A-class); we write in this case zj~z, on K. 

Let us prove the duality 


(9) Vx | Ax. 


We recall from the character theory that from A| A, C| CT, C=A—B fol- 
lows B| A—C. Apply this to the Emmy Noether isomorphism 


(Lx — Hx) — 2x — Hx) ~ Lx —Zx 
and take from (7.3), (7.2) (with r+1 instead of r) 
We get 
(2x — Hx)| @x — Hx), 

q.e.d. 

10. Closed and open subcomplexes. Let Q be a closed subspace of the cell 
complex K and #€Q. Then Agi’ and AgAgi’ = AxAxi’=0; thus Q is a 
cell complex. 


Let Q be an open subspace of the cell complex K. Take any chain x’ on Q 
and define the chain Ex, called the trivial extension of x* over K, by setting 


Ms 


292 P. ALEXANDROFF - [September 


E%x" =x" on Q, E¥x"=0 on K\Q. Then since Q is open on K, we have for 
any 
(Agx = (AxExx 
It follows that 
(Aghox = = 0 


for any EQ, and Q is again a cell complex. 
Thus closed as well as open subspaces of a cell complex are cell complexes. 


CHAPTER I. COMPLEXES 
§3. THE GENERAL THEORY 


11. The extension- and the intersection-homomorphisms. Let Q be a sub- 
complex of the cell complex K and xQ a chain on Q. Any chain on K taking 
on the cells of Q the same values as the chain xQ is called an extension of Q 
(over K). Among these extensions the érivial extension EXxo takes the value 
zero on K\Q. 

If we assign to any chain on Q its trivial extension E%xo, we get the ex- 
tension-isomorphism of LQ into Lx. 

REMARK. If there is no possibility of confusion, we shall identify chains 
with their trivial extensions and thus consider the extension-isomorphism as 
the identical isomorphism of LQ into Lx>Lg. 

Now, let x% be any chain on K. We denote by Jéxk the chain on Q taking 
on the cells of Q the same values as the chain xx. If we assign to any chain 
x’ on K the chain J$xx, we get a homomorphism J§ of Lx into LQ called 
the intersection-homomor phism. 

Obviously, if X| =L(X), then 

x|Lr 
(11.1) Jo 

From now to the end of this chapter K is a fixed cell complex, A is a fixed 
closed subcomplex of K and G=K\A is the complementary open subcomplex. 

It is easily seen that for any chains x4 on A, xG on G, xx on K: 


(11.2) = ExAxs; VExxe = 


(11.3) Wate =Javex; = 


11.4. Therefore, the extension isomorphism maps identically 


A-cycles of A V-cycles of G 
on A-cycles of K on V-cycles of K 


while bounding cycles are mapped on bounding cycles. Thus the extension- 
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isomorphism generates a homomorphism called the extension-homomorphism: 
Ex of Ai into Mx; Ex of Vo into Vx. 


11.5. The intersection-homomorphism maps Y-cycles of K on V-cycles 
of A; A-cycles of K on A-cycles of G (while bounding cycles are mapped on 
bounding cycles) and thus generates a homomorphism, also called the inter- 
section-homomor phism: 


r K r 
Ja Of Vx into Va; Ja of Ax into Ag. 


12. The groups Aku, Vic; Vix, Aox. We write Ox for the zero 
element of both Az and Vx, and use 04, 0¢ in an analogous sense. Now we 
define 


(12.1) Axa = C Ax; Vxc = Exve C (images), 
(12.2) = (Ex) Vax=(Ex) OxCve —_—(kernels), 
(12.3) Vax = JaVx Va; Acx = be (images), 
and prove for the kernels of the intersection-homomorphism: 
(12.4) (a) =Vxe; (Ja) Oo = Ara. 

We prove only the left-hand formula (writing J for J and E for Ef). 

1. If 2 with 4CZG, then Jz’ 


2. If 2° Hj, then 2 with h’ CHR. 
Let us prove 2. We have Je*=Ax;", For any 


— VEx, = (2 — (VExa 
Now 


1 


(VEx, = (JVEx, = (VJE«s = th). 


Hence 
— VEx, )-ts) = 0 for € A 


so that the cycle 2° —VEx;,' is on G. As VEx, CH; the assertion 2 and thus 
the formula (12.4) are proved. 
13. The geometrical meaning of the groups Ax,, Vic; 44:x, Vo:x; Vax» Sox- 
13.1. The subgroup 


Axa of Ax; of Vx 


defined as the image of A (of VG) under the extension-homomorphism has 
for its elements those A-classes (Y-classes) of K which contain A-cycles on A 
(V-cycles on G). 
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13.2. Take all those 
A-classes ha EAs; V-classes EVe 


whose elements bound on K. The resulting subgroup of A (of VG) is the group 
4.x (the group V@.x), defined as the kernel of the extension-homomorphism. 


DEFINITION 13.3. A V-cycle on A (a A-cycle on G) is called extensible 
(over K) if among its extensions over K there are Y-cycles (A-cycles) on K. 


13.4. Then the group Vix (the group AGx) defined as the image of Vx 
(of 4g) under the intersection-homomorphism is the difference-group of the 
group of all r-dimensional extensible Y-cycles on A (A-cycles on G) over the 
subgroup of cycles bounding on A (on G). 

14. The results. From the definition of the groups involved, from (11.1) 
and (12.4) follows by the Fundamental Lemma 4: 


Tr Tr r 
Ag L Vax Va 


(14.1) Ak D Aka L Veo CVE 


Ag DAGk Ve 


and 
(14.2) | (Va Vax); Vo:x | Aor). 


The picture (14.1) contains 
1. two pairs of isomorphisms: the first pair 


An — Aux © Axa; Vo — Ve; 
the second pair 
Ax — Aka © Vx — Vxo Vax, 
2. the pair of dualities 
Axa | Vaxi Aox; 
3. three annihilations: the central annihilation 
Ax L Vo C VK 
and the pair of annihilations 
Aux VoD Vax LAox C Ao. 


In each pair of relations (isomorphisms, dualities, annihilations) the one rela- 
tion is obtained from the other by interchanging A and Y, A and G. 


—— 
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From these relations the first pair of isomorphisms is a triviality, being 
contained in the very definition of the groups involved. 

The dualities in (14.1) and (14.2) are algebraic consequences of the an- 
nihilations and the latter follow from the fact that the extension- and inter- 
section-homomorphisms are conjugate. 

The central annihilation may be brought obviously into the form of the 
following 


REMOVING THEOREM. In order that an r-dimensional Y-cycle (A-cycle) on K 
be homologous on K toa 
V-cycle on G, A-cycle on A 
it is necessary and sufficient that its scalar product with each r-dimensional 
A-cycle on A; r-dimensional Y-cycle on G be equal to zero(?). 


In addition to (14.1) and (14.2) we have 


THE MAIN DUALITY THEOREM. 
r+1 


(14.3) | 
We give to this fundamental duality the form of a pair of isomorphisms 
by means of the following definitions: 


Vax =Va-Vax; = — Aox. 
REMARK. The groups V‘:x, 4G:c can also be defined as the difference 
groups of the groups of all r-dimensional Y-cycles of A (A-cycles of G) over 


the subgroups of all extensible cycles. 
With these definitions we deduce from (14.2) the dualities 


(14. 2") | Va:xi Vox | 
and transform the duality (14.3) into the third pair of isomorphisms: 


+1 r r+1 
(14.3’) = Va:K © Va:k- 


The groups 4’:x, Vax» Ske; Va:x, are called the 
groups of the figure K, A, G. it follows from (14.1)—(14.3) that they are all 
completely determined if we know for instance the groups Ax, A4, Ax, (for all r). 

In Chapters II and III the definitions of the extension- and of the inter- 
section-homomorphisms as well as the definitions of the groups of the figure 
K, A, G are generalized to the case of an arbitrary locally bicompact normal 
space K and its subsets: the closed set ACK and its open complement 
G=K\ A. We show in these chapters that all relations (14.1)—(14.3’) hold in 
that general set theoretical case. 

(?) I am indebted to L. Pontrjagin who first formulated the Removing Theorem and many 


times pointed out to me (as did also L. Lusternik) the desirability of proving it under the most 
general conditions. 
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In particular, the duality theorem (14.3) (or the equivalent pair of iso- 
morphisms (14.3’)) is the definitive generalization of the duality theorem of 
Alexander-Kolmogoroff: If the space K is simply connected in the dimen- 
sion r (that is, if one and therefore both of the equivalent conditions Az =0, 
Vi =0 hold), then obviously A4.<=A4, Hence, if K is simply con- 
nected in the dimensions r and r+1 then (14.3’) turns into the Alexander- 
Kolmogoroff isomorphisms 

r+1 


Let us return to the combinatorial case, that is, let K be a cell complex, 
A a closed subcomplex of K. Taking as coefficient-group the group I» of all 
integers reduced modulo m and denoting by Ax(m), A4(m), Vikg(m) 
and so on the corresponding groups (of K, of A, of G, of K, A, G) we see 
by the first pair of isomorphisms that the order of the group A4(m) is equal 
to the product of the orders of Ax,4(m) and Aj.x(m) and thus is completely 
determined by the last two groups. Now by a theorem of M. Bockstein [6, 
p. 373] all Betti groups of a complex K are completely determined by the 
orders of the groups Axk(m) taken for all m and all r. Thus the groups A 
are determined by the groups Ax,4(m) and Aj.x(m) (taken for all m and all r). 
An analogous result holds for Vg, while Az is determined by the Ax,4(m) 
and Vig(m). 


14.4. Hence if K is a cell complex, A a closed subcomplex of K and G=K\A, 
then the groups A, and AG (and all groups of the figure K, A, G) for any r and 
any coefficient-group are completely determined by the groups Ak(m), Ax,4(m) 
and Ai4.xn(m) taken for all m and all r (while the groups Ax are determined by 
Aka(m) and Vizg(m) taken also for all m and all r). 


In Chapter III, section 40, we prove that Theorem 14.4 still holds if we 
understand by K and A topological images of (finite) polyhedrons (in the 
terminology of [4, p. 128]). Thus for topological images of finite polyhedrons 
the homological situation-problem may be considered as being completely solved. 


REMARK. The third pair of isomorphisms is given for the first time in 
my notes [2]. 

15. The operator VE{ and the proof of the third pair of isomorphisms. 
To any V-cycle 2 on A corresponds the Y-cycle VE%2, on G which by its 
very definition bounds on K. Let us prove: If % is extensible, then VE#d, 
bounds on G. By supposition there exists a V-cycle 2” on K such that 24 =J%2z" 
and therefore (we write E instead of E#) 


r r Kr 
= Ex +Jez, 


r r Kr 
= 


r Kr 
= — VJez. 
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As Jkz* is a chain on G, VE, bounds on G and our assertion is proved. 

Thus the operator VE produces a homomorphism of V's:x into Vaz. We call 
this homomorphism again VE and prove that it is an isomorphism of V4-x« 
onto 

To prove that VE maps V4i.x on the whole V@'¢ take arbitrarily 


so Ese Vax. 
We have to find such a Y-cycle % on A that 2°'—VEz, bounds on G. 
By supposition there exists a chain x” on K bounded by 2" and conse- 


quently 
Kr K r Kr 
Vasax =JaVux = Sate = 0; 


hence 2, = Jix" is a YV-cycle on A and x4 =x'—EJ%x" is a chain on G with 


r+ 


(15.1) = Vax — = oo VEza. 


It remains to be proved that VE maps V4:x on V@x isomorphically. To 
this end we show that if for a certain 


ta ba Vax 
the V-cycle VE24 bounds on G then there exists such a Y-cycle 2” on K that 


2, = Ske. 
By supposition 


VE%,=Vxe, Le. 
Therefore 
V(Ess — xa) = 0, 
so that 2° = Ez,—xG is a V-cycle on K with 
Jas = Ja Et, — = 84, 


q.e.d. 

REMARK. In an analogous way to any A-cycle %*' on G corresponds the 
A-cycle AE%2G*! on A bounding on K, and the operator AE% produces an iso- 
morphism of the group A@g onto the group Aj... 


§4. SPECIAL CASES 


16. Numerical relations. In this section the coefficient-group is either the 
field I, m being a prime, or the field 9 of the rational numbers, the latter 
being treated according to the remark of section 5. Now all dualities turn 
into isomorphisms. 

Let p denote the rank of a group of linear forms with respect to the given 
coefficient-field. We put 7x =p4z =pV (Betti numbers) and 


P. ALEXANDROFF 


= pVax = pAxa; wxo = = pVxe; 

r Tr r r r 
Wa:K = = pVa:K} = pVa:x = pAa:x: 


Then by the first pair of isomorphisms 


r r r r r Tr 
(16.1) = + Tq = + 


while by the second pair of isomorphisms 
(16.2) wx = + 
or 
= (ma - + (we wo:x). 


Using the third pair of isomorphisms and changing r into r+1 we get 


r+1 r+1 r+1 


r 1 
(16.3) —(m — wa:x)s 


or 
r+1 


(16.4) ro = max + (wx — 

In the special case when K is an n-dimensional (orientable) manifold an 
equivalent formula has been given by Pontrjagin; in this case by the Poincaré 
duality (see section 42 below) we have 1g''=2%-"-' and (16.4) can be written 
as 


which is the Pontrjagin formula [11, p. 449, Theorem IT]. 

17. Application to regular components. Now suppose that K is a homo- 
geneously n-dimensional cell complex (that is, there are in K no cells of a 
dimension greater than n while each cell of a dimension less than m is a face 
of an n-dimensional cell). An (n—1)-cell #"-' is called regular if there are in K 
two cells # and @ such that #*~' is a face of +f and +& and of no more 
cells of K. A (finite) sequence ff, - - - , # of n-cells is called an r-sequence if 
in it any two subsequent cells have in K a common face of a dimension at 
least r—1. Two n-cells of K belong (with all their faces) to the same r-compo- 
nent of K if they can be connected (in K) by an r-sequence. The cells common 
to two different r-components of K are of dimension at most r—2. A complex 
is called r-connected if it consists only of one r-component; an m-connected 
complex K is called an n-dimensional pseudomanifold if all of its (%—1)-cells 
are regular. There are orientable and non-orientable m-dimensional pseudo- 
manifolds: the group Vx(J) of an n-dimensional pseudomanifold K is always 
cyclic, infinite if K is orientable, and of order 2 if K is non-orientable. It 
follows that if the m-dimensional cell complex K is in any way decomposed 
into a certain number of n-dimensional pseudomanifolds no two of which 
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have a cell of dimension at least »—1 in common, then the group Vx(J) 
is the direct sum of as many infinite cyclic groups as there are orientable 
pseudomanifolds and of as many groups of order 2 as there are non-orientable 
pseudomanifolds in our decomposition. 

Let K be again an arbitrary homogeneously n-dimensional complex; de- 
note by A,_; the subcomplex of K consisting of all (7 — 1)-dimensional irregu- 
lar cells and of all faces of these cells. The m-components of the open subcom- 
plex Gt=K\A,- are called regular components of K (we slightly change 
here the terminology of [4, p. 190]). The decomposition of G,_: into its n-com- 
ponents (the regular components of K) is a decomposition of the kind just 
mentioned and therefore the number q(K) of the regular components of K is 
equal to the n-dimensional Betti number module 2 of Gr», while the number 
go(K) of the orientable regular components of K is the ordinary n-dimensional 
Betti number of G,.1. Denoting as usual by 7’ the ordinary Betti numbers 
and by 7°(2) the Betti numbers modulo 2 we get, using (16.4): 

17.1. For any homogeneously n-dimensional complex K: 


(17.1) g(K) = + go(K) = + 
In particular, if all (n—1)-cells of K are regular, then 
(17.11) q(K) = rx(2);  go(K) = ax. 


This is the case with the complex G=K\A if the closed subcomplex A CK 
contains all irregular (n—1)-cells of K. Therefore, applying again (16.4), we 
get: 

17.2. If the closed subcomplex A of the homogeneously 2-dimensional 
complex K contains all irregular (n —1)-cells of K, then 


(17.2) q@= wa:x(2) + — wxa(2); go(G) = 


If A is moreover (m—1)-dimensional, then 
(17.3) q(G) = wax(2) + 9x(2);  qoG) = + 


Now we call any cell of K regular if the n-dimensional Y*-group of its 
open star (with respect to the group J) is infinite cyclic. This definition agrees 
for (n—1)-cells with the previous one. If A contains all irregular cells of 
dimensions at least r then the r-components of G are identical with the regular 
components of G. Thus for r=0, that is if A contains all irregular cells of K, 
the ordinary components of G coincide with the regular ones and their number 
is still given by (17.2) and, in the case of an (n—1)-dimensional A, by (17.3). 

REMARK. If no special hypotheses on the closed subcomplex A are made, 
then the number of components of the open subcomplex G=K\A depends 
upon the homological properties of the mutual situation in K of two subcom- 
plexes: the given subcomplex A and the subcomplex A, of all irregular ele- 
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ments of K. Even the most elementary examples show that the features of 
this mutual situation may be very different. 

18. The Phragmén-Brouwer theorem. Let A be as always a closed sub- 
complex of the cell complex K, G=K\A and 


= A; and A; closed in A, 
= A\A1, Qe = A\Az; Gi = K\Ai, G2 = K\Az, 
Ag = As. 


Denote finally by E4.x, i=1, 2, the homomorphism of VQ, into V4.< which 
assigns to each element 2’ of VQ, the class 3°€V4.« containing the element 
Ez". We shall prove the isomorphism 

Note that the elements of the group (Eh :xVG,) \(E4:xVQ,) can be defined as 
being those classes 3°EV‘4:x of V-cycles which contain both cycles on Q, and Qz2; 
the elements of V@.¢,/\VG.c, are V-classes of G whose elements bound both 
on Gi and Go. 

The isomorphism (18.1) is realized by the operator VEE: let 35 €(Ei:xVoQ,) 
(\(E4:xVQ,). Take on Qi. Then is equal to zero on Ai, thus 
bounds on G;. Consequently ail elements of the V-class VE{;3€V@"' bound 
on G,. Taking 25 €35 on-Q2 we see in the same way that all elements of VE%% 
bound on G:. Hence the isomorphism YE% maps the left-hand member of 
(18.1) into the right-hand one. To show that this mapping is on the right- 
hand member of (18.1) let 


so = te \ Vener 
As 2%! bounds on G;, i=1, 2, it is 4! =Vx, with x, on G;. Therefore 2 =J* x; 
is a V-cycle on Q; and by (15.1) (with x4=x,—EJ%xj) we have: 


Ar r+1 
E 4G 


thus 24 €45, and 3 contains cycles on Q; as well as cycles on Qs, q.e.d. 
Let K be simply connected with respect to r; then the isomorphism (18.1) 
is replaced by 
r r+1 r+1 
Now 
Aa > Ada, Vacs S Vas 
therefore 
Aa (Ada, + Maas) L (Waa, Vac.) Va 


(the sum at left may fail to be direct) and 
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(18.3) — (Ada, + Ada) | 
To any A-cycle 2” on A corresponds the A-cycle 


=— on Ao = Az 


(where A operates on A). This correspondence generates a homomorphism 
(the “Naht-Homomorphismus” of [4, p. 292]) of 44 onto A%..4,04%,4, with 
the kernel A44,+A44, (the proof is easy and is given in [4, pp. 289-293]). 
Thus 


(18.4) = Ulan + 


From (18.2)—(18.4) follows a duality which we call 


THE PHRAGMEN-BROUWER THEOREM. 

(18.5) (Agar | Vere,)- 

19. Examples. 

1. The elements of the complex K are: a triangle, its edges and its ver- 
tices, thus the cells +7, +#, +, i=1, 2, 3, with obvious incidence coeffi- 
cients. The open subcomplex G consists only of +#, the closed subcomplex A 
of the remaining elements of K. The operator VE% assigns to the V-class 
consisting of the V-cycles #, 4 on A the V-class 22€V% the only 
element of which is #. The groups V4(J) and V3(J) are both infinite cyclic 
(duality theorem of Alexander-Kolmogoroff). 

2. Consider the circular ring in the plane decomposed into the cell com- 
plex K whose elements are: +#, +#, +#, i=1, 2, 3, with the incidence 
coefficients 


(i) = =1, =0, =1, = - 1, 


= 0 if i, 7 = 1,2. 


The open subcomplex G contains only the cells +# and A=K\G. All groups 
of the figure K, A, G are determined by their ranks: 
0 0 
WK > WKA = 


1 1 
TK WKA 


2 2 2 
WK = TKA = TA = Ta:K = TKE = 0; 

3. The elements of K are +#, +#', + with the incidence coefficients 
(#:41) =2, (#:2) =0; the elements of A are +#' and +? (the cell complex K 
is a cell decomposition of the projective plane; then A is a line on this plane). 
The group A}.« consists of all elements of the form +2n¢! with an arbitrary 
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integer n, while the elements of A3.,=A% are of the form nf; thus A}. and 
A3.x are both infinite cyclic. 
4. The cells of K are: +4, +8; +21; +# with the incidence coefficients 


= (tt) =2; =0 
(two projective planes intersecting in a line); let the cells of A be +4, +2, 
+2. The coefficient-groups are I for V and 7 for A. Then Vx is the direct sum 


of an infinite cyclic group (generated by 4) and of a group of order 2 (gener- 
ated by #—#). Correspondingly A; ~7+J, and 


2 2 2 2 2 
Vec = Vac Axa Vak = Va Ip. 


The geometric meaning of the annihilation Aj D Ak, is in this case 
particularly simple. 

Now let K be the same, while the cells of A are +#', +/ (that is A is the 
intersection line of the two planes). Then V4.c=V4 =I, Vig=Vi=I+h, 
Vz ~I+TI, while V%.x (isomorphic with V}.x) is the infinite cyclic group gen- 
erated by 2(@—8&). 

5. Let K be the three-dimensional (continuous) polyhedron composed by 
the interior and the surface of an ordinary anchor ring. Let A be a ring-shaped 
body (homeomorphic with K) lying in the interior of K and going twice around 
K. We suppose moreoyer that A is decomposed into the simplices of a certain 
closed subcomplex A of a simplicial decomposition K of K; then T'=K\A is 
decomposed into the simplices of the open subcomplex G=K\A of K. The 
coefficient-groups are J for A and 7 for VY. Then 
(the equator of K generates the group Ajx; taken twice it bounds on G). The 
group Ak, is not a direct summand of the group Ak. 

6. Let K be a complex which arises when we identify the two poles of an 
ordinary two-dimensional sphere. A is composed of two parallel circles of 
this sphere. It follows from (16.4): 


= + — = 2+1-053. 

7. In addition to the well known elementary examples illustrating the 
theorem of Phragmén-Brouwer we illustrate the formula (18.1) by a torus 
surface K on which two meridians A; and Az form the complex A; the whole 
figure is considered in a cell decomposition in which the two meridians 
A, and Az, oriented in opposite senses, are cells ¢| and 4. Then 4 —& is a V-cycle 
on A extensible over K; therefore | and 4 belong to the same class 34€Vi:x; 
the group (E4.«VQ,)\(E4:xVQ,) is generated by this 3! and is infinite cyclic. 
The group (Vé.¢,\Vé.c,) ~I is generated by #—4, where the cells 4 and & 
correspond to the two domains into which the ring surface is decomposed by 
the meridians A; and A». An open plane circular ring K and two radial seg- 
ments A, and A; on it could be treated in the same way. 
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PROPERTIES OF COMPLEXES AND CLOSED SETS 


CHAPTER II. SPECTRA 
§5. CELL SPECTRA 


20. Projections. Let K, and Kg be two cell complexes. Let there be given 
for each dimension r a homomorphism of of the group L;(J) of all r-dimen- 
sional integer chains of Kg into the group Lj(J) of all r-dimensional integer 
chains of K,. We suppose moreover that these homomorphisms commute 
with the operator A, that is, that for any (integer) chain x, on Kg 


(20.1) = 


Under these conditions the system of the given homomorphisms af is called 
a projection of the cell complex Kg into the cell complex Ka. 

Let A.C K., Kz be closed subcomplexes of K., Kg respectively. We 

put G.=K.\Aa, Gs=Ks\Ag and write systematically Lj, Li, Li, for 
Ka *, and J%, +++ for Jis, 

Any chain on Ag, Ag being a chain on K., Ks, we say that the projection 
af is cogredient (with Ag, Aq) if every (integer) chain of Ag is mapped by of 
onto a chain on A,. As for xg on Ag, x, on Aq, the boundaries Axg, Ax, are the 
same on Ag, A, and on Kg, Ka, the projection ow of Ks into K. (cogredient 
with Ag, A.) generates a projection a8 of Ag into Az. 

Now define for any chain xf, on Gg: 


a Br 


(20. 11) = J 


Then whatever be the chain xg on Kg, J2,08xs=Ji,08Ji,xs. We use this 
remark in 


a Br a Br a r a 6s r 
= Aa = = J = J 
B1 r B1 r 


where Aq, Ag, Agi, Ag: operate on Ka, Kg, Ga, Gz. 

20.2. Thus a projection af of Ks into K, cogredient with As, Aa generates 
a projection of of Ag into Aq (identical on Ag with of) and (by (20.11)) a 
projection af} of Gs into Ga. 

21. Conjugate homomorphisms. The projection of of K, into K. assigns 
to each chain #3=) a5; on Kg (with respect to a given coefficient-group %) 
the chain w%%, =) joo, on K (with respect to the same coefficient-group 
%) and thus produces for any r a homomorphism af of the group Z,=L$(%) 
into L,=L*(Y) which will still be called a projection. The conjugate homo- 
morphism 23 of Li, =L(%) into L3=L;(%), where %|%, assigns to any chain 
x, the chain defined by 


(21.1) (xg = 


whatever 4© Ks. The homomorphisms 7% commute with V as is seen by 
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arr ar r r Bir r Br 
= = ) = ) 
r B r+ a r+l 
= (V%a'Dals = ). 
Moreover, if of is cogredient with As, Aq then for a chain x,, on G, and any 
= = 0, 
so that 73 maps Lj, into Lj, and thus produces the homomorphism 7% of Li, 
into L, identical with 7g on Li, CLy and conjugate with of}. 
On the other hand we have the homomorphism 738 conjugate with of} 


in the form 
a fr ar 


B r 

Xa0 = J Xa0 for any © 

We collect all we have just said about the cogredient projections a in the 
following formulas: 


a 


(21.2) 


where 


(21.3) 8 8 al a 


1 a 


0 8 ad a 
{ = Sons, 


and af, a, a8} commute with A, while 7%, 1%, 7, commute with VY. There- 
fore the homomorphisms of, of, wf; map Aj, Ajo, Ag respectively into 
Mix, While 7%, ™1 Map Va, Vaor Var into Vg, Veo» Var: 

a Vv. Va Ae a v. 
Ves As LVs0 Ago Ver Agi 


From the definition of a, - follows furthermore: 


22. Cell spectra. Two projections of and w/’ of Kz into K. are called 
homologous to each other if they produce the same homomorphism of Ag 
into Aj. 

Now let there be given an unbounded partially ordered set of cell com- 
plexes K.; we write 8 > a if in this set Ks follows K.; “unbounded” means that 
to each two elements of the set a third element of the same set can be found 
which follows on each of the two given elements. 

Suppose that for any 6>a a finite number of “allowed” projections af 
of Ks into K, is given in such a manner that: 
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1. Any two allowed projections of Ks into K, are homologous. 

2. If y>8>a and a}, of are allowed projections then the projection 
wy = of K, into K, is allowed. 

Under these hypotheses the partially ordered set of the cell complexes K. 
with their allowed projections of is called a cell spectrum and denoted by 


(22) K = {Ka, @a}. 


A cell spectrum (22) determines two group spectra [1, pp. 58-62] or 
homomorphism systems [13, pp. 668-669]: the inverse group spectrum 
(A‘,, @&) and the direct group spectrum (Vj, 7), the homomorphisms of the 
two spectra being conjugate. Therefore the limit groups (see section 23) 
called the Betti groups of the cell spectrum K, 


Ax = lim inf @2); = lim sup 


are dual. 

Now let the closed subcomplexes AgC Kz be given in such a way that 

1. each allowed projection of is cogredient with As, Aa; 

2. each two projections of As into Aa; of Gs into Ga, gen- 
erated by allowed projections of, w/° of Kg into Ka, are homologous to each 
other. Then the given cell spectrum (22) is called cogredient with the subcom- 
plexes A, and G,; such a spectrum defines the cell spectra 


A = Geo}; G= wa} 


with their Betti groups 
= lim inf (Azo, @a0), = lim sup (Wao, 
AG = lim inf (At, = lim sup (V1, a1) 
and the dualities 
Va; Vo. 


§6. THE EXTENSION AND THE INTERSECTION HOMOMORPHISMS AND 
THE GROUPS OF THE FIGURE K, A, G FOR CELL SPECTRA 


23. Lemmas on group spectra. In this section “spectrum” means “group 
spectrum.” Let (U., 7%) be a direct spectrum with the limit group U. The set 
theoretical sum of all groups U, is decomposed into bundles or classes of 
equivalent elements, where two elements u.z@U. and usCUsz are called 
equivalent if there exists a y>a, 8 such that 1u.=7ug. These bundles are 
the elements of the limit group U. To get the sum of u={u.} GU and 
4! = {ud }eu take u.Cu, ug Eu’ and any y>a, 8. Then u+u’ is defined 
as the bundle containing %u.+7ug (see [1, p. 59]). Let (U., #8) be an in- 
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verse spectrum. The elements of the limit group U=lim inf (U., 8) are 
“threads,” that is systems a= {#.} of elements #.€U, such that: 

1. the thread @ contains one and only one element #@, of each group U,; 

2. if and B>a then = 

The sum of two threads a= {a} and @’={#/ } is the thread {#.+a/ } 
(see [1, p. 58]). 


Lemma 23.1 (obvious). If in an inverse spectrum (Ua, of) the subgroups 
are given in such a way that &U U a0 for B> a then the groups Uao 
with their homomorphisms of =of form an inverse spectrum and U)=lim inf 


of) is a subgroup of U=lim inf (U4, a). 


If in a direct spectrum (U., 7) the subgroups U.oC U, are given in. such 
a way that Uso, then the Uso with their homomorphisms 1) 
form a direct spectrum (U.o0, 7). Any bundle of this spectrum is contained 
in a bundle of (U., 7) and any bundle of (U., 7%) contains at most one 
bundle of (Uao, 7%). Therefore the group Us=lim sup (Uao, 70) is mapped 
isomorphically on a subgroup of U=lim sup (U., 7g) and can be identified 
with this subgroup(’). 


LEMMA 23.2. Under the previous hypotheses the group 


= Us Va:ao = Ua — Uao 


is mapped by 
into = Ue into = Us — Uso 
and (*) 
U — Uo = lim inf — Uo = lim sup (Ua:an, 


Let us sketch the proof of Lemma 23.2. To prove the left-hand as- 
sertion we take any element of U—Uo, that is, any class of threads 
a={a.}€U congruent with respect to the subgroup UCU, and as- 
sign to each thread #={u,} belonging to this class the thread {u.} 
Elim inf (0.—Uao, of) where This correspondence is 
easily seen to be an isomorphism between U— UJ, and lim inf (0.—Uao, a). 

To prove the right-hand assertion define the group U’ as follows. Call the 
elements u_ and ug of the set theoretical sum of the groups U, equivalent with 
respect to Uy if a y>a, B can be found such that 2%u_.—mugE Uy. The set 
theoretical sum of the groups U, falls now into classes of elements equivalent 
with respect to U» and these classes form the group U’ (we define the sum of 
two classes u and wu’ as follows: take u.Cu, ug Eu’ and y>a, 8; define u+u’ 

(*) In cases when among all isomorphisms between two isomorphic groups a well defined 


isomorphism is chosen in some “naturai” way we often replace the sign of isomorphism by the 
equality sign and consider the given groups as being identical. 


= 
| 
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as the equivalence class containing r%u.+7%ug ). The group U’ is easily seen 
to be isomorphic with both U— U, and lim sup (U,— Uso, 7). 


LemMA 23.3. Let two direct spectra (Ua, pg) and (Va, 0%) and two inverse 
spectra (U., and (Va, be given [the elements U. and (Ua and Va) 
of these spectra correspond to each other in (1-1) manner]. For every alet a 
homomorphism 


fa of UaintoVa; faofVa into Ua 
be given in such a way that for every B>a and ta Ua (fa€ Va) we have 


(23.31) = fapsta; = Fab ais. 
Then we obtain a homomorphism 
f of U = lim sup (Ua, pa) into V = lim sup (Ve, os); 
of V = lim inf (Ve, #2) into U = lim inf (U., 


by putting in correspondence to each 


bundle u={ua} thread i= 
the bundle v=fu the thread u=fo= {fia}. 
containing for a u.zCu 
the element fata. 
The proof of Lemma 23.3 may be left to the reader as well as the proof of 


the following 
LEMMA 23.4. Under the hypotheses of Lemma 23.3 


and (taking into account Lemma 23.1) 

(23.42) lim sup (faUe, os) = fU; inf (f.V., = 
The same is true for the kernels of the homomorphisms fz, f, fa, f: 

(23.43) ps(f Oz) fo Op; 0s) fa Oe 

(we write 0, 0g, 02, Og for the zero elements of Va, Vs, Ua, Us), and 


(23.44) lim sup (fe Oa ps) =f Ov; _liminf (f. Oe, =f 


Finally we have 
Lemma 23.5. If, still under the hypotheses of 23.3, we have for every a 


z 
| 
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U Ut. 
The proof follows from the definition of the scalar product for limit groups 
[13, p. 670, formula (6.2) ], namely: 
For any u= {ua} lim sup (Uz, pg), lim inf (V., 78) 
(23.51) (u-%) = (tata), 


where uq is chosen arbitrarily in u. 
The formulae (23.51) and (23.41) lead to 


(fu-d) = (fatta'Ha) = = (u-fo), 


which proves Lemma 23.5. 

24. Simplified definitions of the V-groups of a cell spectrum. Lemma 23.2 
allows the definition of the group Vx, where K is a cell spectrum K = {Ka of } 7 
to take the following simple form. Two V-cycles % and 2% of K., Kg, respec- 
tively, belong by definition to the same V-class of K if there exists a y >a, 8B 
such that 7%2,—2%2, bounds on K,. These V-classes are the elements of Vx. 
The sum of two V-classes 2” and 2’" is defined as follows: take 2,€2’, 2g €2” 
and y>a, 8. Then 2"+2"" is defined as the V-class containing 12, +7425. 

25. The extension and the intersection homomorphisms. The groups of 
the figure K, A, G. By the formulae (21.5) and (21.6) and the Lemmas 23.3 
and 23.5 the homomorphisms E%*, E’, J%, J, generate the homomorphisms 
Ei, E%, J%, JE of the corresponding limit groups according to the figure: 


K G 
(25.1) | > . 
Va Aa Ac Ve 


| According to Lemma 23.4 (and Lemma 23.2 used in (25.3)) define: 


Vax = JaVx = lim sup (Va,x,, ™50); 
Aka = = lim inf (Ax 4.) 
r r r 8 
Acx = Johx = lim inf (A¢_x,, a1); 


r Ger r a 
= ExVe = lim sup (Vx,<¢,, 7s); 


Aa:k = (Ex) Ox = lim inf 
= (Ex) Ox = lim sup 


(25.3) r r r r ad 
= Va — Vax = lim sup x,, 780); 


r r r 
Ae:x = Ag Acx => lim inf (Ag,:x,, a1). 


Then by Lemma 23.4 and the formula (12.4): 


308 


PROPERTIES OF COMPLEXES AND CLOSED SETS 


K.-— r a r 
(Ja) Os = lim sup (Vxaca, ™8) = 
(25.4) 


(JG) Og = lim inf = 


From these formulae we get literally as in section 14 the relations (14.1)— 
(14.2) for the spectra K, A, G. 

. 26. A direct definition of the groups of the figure K, A, G. Since any 
A-cycle (A-homology) on A, is a A-cycle (a A-homology) on Kz, to any thread 
{ Bio} corresponds the thread { where 3) is com- 
pletely determined by 3%,C 3%. This thread 2% is the thread E424. 

Since any V-cycle (V-homology) on G, is a Y-cycle (V-homology) on K., 
any V-class (section 24) 22€Vq is a subset of a well determined Y-class 
tk €Vx, and this 2% is the V-class 

Let { 3%} Then { 3%} is a thread and this thread is the 
thread AG. 

Let 2 = GVx. Then all J3,2, are elements of the same Y-class 24 
and this V-class is 

Thus we get the following direct definitions of the groups of the figure 
K, A, G: 

The elements of are threads = { 32} €Ax containing threads 
2, = { 350} €Ay in the sense that each A-class 3%, is a subset of the corre- 
sponding A-class 3}. 

The elements of Vg are V-classes 2g EV containing as subsets V-classes 
2g 

The elements of VixCV‘4 are V-classes 24€V4 containing among their 
elements extensible Y-cycles, that is, V-cycles 2% on Aa of the form 25 = J%27, 
* where z, is a Y-cycle on For any element 2 of 2, Vx there is a B>a 
such that is extensible. 

The elements of AGxCA@ are “extensible threads” {351} that is, 
threads whose elements 33, are all extensible (that is of the form 33,=J2,%, 
where 

The elements of are threads 24 = { bounding on K in 
the sense that the elements of each A-class 34, bound on Kg. 

The elements of V%.nxCV@ are V-classes 23€VG@ bounding on K in the 
sense that 2G is a subset of the V-class 2g =0 representing the zero of the group 
Vx. In other words, for any 2, 2% there is a 7g12,, bounding on Kz. 


DEFINITION 26.1. The V-cycle to on Aq and the V-cycle tp on Ag belong 
to the same extension class (with respect to the cell spectrum K) if there exists a 
y>«a, B such that — is extensible. 


DEFINITION 26.2. The threads 2%={3%,} and belong to the 
same extension class if for each a the A-class 34 —i ts extensible (that is, is of 
the form — = With 


4 
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Now the elements of V4.,=V4—Vx can be identified with the extension 
classes of Y-cycles 2. To get the sum of two extension classes 2’ and 2’’ take 
as always %»€2", 2 €2’’, choose any y >a, 8 and define 2"+2’' as the exten- 
sion class containing 7°$%9-+7o2; the group V4.c can be defined as the 
group of the extension classes of r-dimensional Y-cycles on A, (a variable) 
with this addition. 

The elements of the group are extension classes of threads 2€ Aj. 

27. Proof of the third pair of isomorphisms. Let 


The V-cycle VE%z,, is on G, and bounds on Kg. 
27.1. If 2 and 2% belong to the same extension class, that is, if for a cer- 


tain y>a,B 
£0 r 
(27 . 1 1) 70280 = 


where 2, is a V-cycle on K,, then 


(27.12) 200 — 30 Hy 


(Hz, is the group of all r-dimensional V-cycles on G, bounding on G,). 
To prove (27.12) we first deduce from (27.11) and section 15: 
On the other hand 
200 VE} = Es Ey 50300) 
and is on G,. Thus 
70 adr 


(27.14) 200 — VE) 


and in the same way 
70 r 


(27.15) — VE, Hen 
From (27.13)—(27.15) follows (27.12) and therefore (27.1). 

Thus the operators VE%° generate a homomorphism—we call it VE—of 
Vix into 

27.2. The homomorphism VE maps V's:x onto Vag. To prove this let 
and We are seeking a and a Y-cycle % on Az 
such that ; 


(27.21) VEs 250 Hos 
From 2i'€2t'Cvete and the definition of VG" it follows that there exist 
a B>aand a chain x, on Kg such that As is on Gz, the chain 


} 
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Jhoxh = is a V-cycle on Ag, while x3— is on G and 
— Ep = Vap — VEs = — 290 


from which follows (27.21) and thus (27.2). 

27.3. VE maps Vi:x on Voix isomorphically. Suppose that the element 
2,4 ©Vi-x is mapped by VE on the zero element of the group Vg. We shall 
prove that the extension class 24 is the zero element of the group V4:x. Take 
2% 24. It follows from our hypotheses that there exists such a 8> a that 


Since 

200 — VEp © Hoi 
(see (27.14)) and 7$02%9€z4, we may suppose from the beginning (replacing 
B by a and by 2%) that for some 


atr 


To deduce from that the identity 2 =0 we have only to prove that 2» is ex- 
tensible. To this end take a chain x,, on G, bounded by VE%2%o. Then 
is a V-cycle on Ka and J2oz%, =z, q.e.d. 


§7. SIMPLICIAL SPECTRA 


28. Nerves. We denote by N, N., Ng,--+ and call merves the finite 
complete simplicial complexes (complete means every face of an element of NV 
is itself an element of N). The oriented simplices of N form a cell complex 


denoted by WN too. 
Let N., Ng be nerves, C., Cs closed subcomplexes of N., Ng, respectively, 


and 

Ka = N.\Ca, Ks = Na\Cz. 
Let of be a simplicial mapping of Ng into N. such that the image of Cs under 
of is contained in C,: 


(28) Cu 


Then in a well known way of generates a projection denoted also by of of 
the cell complex Ng into the cell complex N.; for any oriented simplex 
- - &) of Ng we put off - - - if the vertices - - -, 
are all different, and of=0 if for some i1+j, ofe8=o8, whereupon for 
any chain x3=) aify, o8x3=) a,0%%. The projection of of Ng into N. 
generates a projection of of Ks into K. by setting 

= J for any x3 € L; = Lx, 
(see section 20 where Ka, Ge, of, af are to be replaced by Na, Ka, 0, of, re- 
spectively). 


i 
i 
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A known fact is(*): 


LEMMA 28.1. Let of and be two simplicial mappings of Ng into both 
satisfying the condition (28) and the condition: 

(28.11) For every simplex there exists a simplex containing 
among its vertices all vertices of both o8Ts and o!®Ts; if in particular Ts€Cz 
then T., can be chosen among the simplices of Cua. 

Under these hypotheses the projections off and wl? of Ks into Ka generated by 
of and a! are homologous to each other. 


The proof is as follows. Let ¢:, é2, - - - , @, be all vertices of Ng in a definite 
order. Take a copy Ng of Ng with the vertices e/, ef, - - - , ef and consider 
all simplices of the form | ef, - where p=0, 1,---, 7 and 
is any simplex of Ns. The simplices just defined and their faces 
form a nerve Ng’ called the prism over Ns. Now define for each oriented 
simplex 4=(e;, - - - e:,) of Ng the prism over t as the chain 


p=r 
Ilég = > (- 1)" (ei, eee ei, Ci, eee €i,) 
p=0 


while the prism over the chain x=) ait GLY, is by definition the chain 
Ix =) a; 

on NZ’. An easy calculation shows: If 2 is a A-cycle on Kg then 

(28.12) Allzs = 23 


where zg is the copy of 23 on Ng and A operates on Nj’ at the left and on Ng 
at the right (see [4, p. 199]). ; 

Now map the vertices of Ng by of and those of Nf by off. By virtue of 
(28.11) we get in this way a simplicial mapping o,° of N{’ into N.. Denoting 
by the chain (on N.), by x, the chain which is on Cz. 
we get 


r+1 Br Br r 
An,%a = — — Xa 
and, K, being open on N,, 
N r+1 Nebr Na Br Br Br 
= = JK = 28 = Darg — Da 2g 


q.e.d. 

In the next chapter (section 31) we need the following 

COROLLARY 28.2. Let Aa, Ag be closed subcomplexes of Ka=Na\Ca, 
Ks=Na\Cz; let of, of map Cz into Ca as well as Ag into Aq. Suppose the con- 


(*) We write |e - - -e-| for non-oriented and - - for oriented simplices. 


1943] PROPERTIES OF COMPLEXES AND CLOSED SETS 313 


dition (28.11) satisfied; suppose moreover that for TsGAg the simplex T.. of the 
latter condition can always be chosen in A. Under these hypotheses not only the 
projections of and wl? of Ks into Ka generated by of, off are homologous but 
the same is true for the corresponding projections of}, wi? and oft, wi of Ag 
into A. and of Gz into Gz. 


To prove the assertion concerning As, Aa replace in 28.1 Ns, Na by 
AsUCs, AacU/C, and Ks, Ka by As, Aa. To prove the assertion concerning 
Gs, G. replace in 28.1 Cs, Ca by Cs\U Ag, Ca. and Kg, Ka by Gz, Ga. 

29. Simplicial spectra. Let an unbounded partially ordered set of nerves 
N., be given; we write 6 >a if in this set Ng follows on N,. Let (for each a) 
C. be a fixed closed subcomplex of N.. We suppose that for each pair B>a 
a finite number of simplicial mappings of (called projections) of Ng into N. 
is given and that the following conditions are satisfied: 

1. of maps Cs into C.; 

2. for each pair 8 >a the projections o satisfy the conditions (28.11); 

3. if y>B8>a and oj, of are projections, then the simplicial mapping 
of N, into is a projection. 

Under these hypotheses the partially ordered set of nerves N,, of their 
subcomplexes C, and of the projections of is called a simplicial spectrum 


(29.1) N = {Na, Ca on}. 


A simplicial spectrum (29.1) generates according to section 28 the cell spec- 
trum 


(29.2) K = {Ka, a}, 


where by K, is denoted the open subcomplex N.\C, as well as the cell com- 
plex of all oriented simplices of this open subcomplex; the projections af are 
(as defined in section 28): 
= for any xs Ls = Lx, 
REMARK. The complexes C, are called special subcomplexes of the spec- 
trum N; if they vanish the spectrum is called bicompact. 


CHAPTER III. SPACES 


§8. THE CELL SPECTRUM AND THE BETTI GROUPS 
OF A LOCALLY BICOMPACT SPACE 


30. The simplicial and the cell spectrum of a locally bicompact space. 
The nerve N is called the nerve of the finite system 


a= A,} 
of subsets A; (of the given set R) if the vertices ¢,, - - - , ¢, of N are in (1-1)- 


| 
| 
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correspondence with the elements A;,---, A, of the system @ and any 
Cig °° * » i, are vertices of a simplex of N if and only if the associate sets 
A;,, , Ai, have a non-vacuous intersection (°). 

Now let K be a locally bicompact Hausdorff space. We consider the set 
of all finite coverings, 


{oa1,*** 5 = Sa, 


of K by open sets 01, « - - , Qas. The nerve of a is denoted by Nz. We denote 
by C, the subcomplex of N, consisting of all simplices of N. whose vertices 
correspond to sets 041€a@ having a non-bicompact closure 64; in K. Thus the 
open subcomplex K.=N.\C,. is the subcomplex of all simplices of N. which 
have among their vertices at least one corresponding to a set 04:Ga with 
bicompact closure. The set of all N. is partially ordered: we set 8 >a if 6 is 
a refinement of @ (that is if each element of 6 is a subset of one or more ele- 
ments of a) while a is not a refinement of 8. 

Let B>a. We get by definition a projection of of Ng into N. if we 
assign to each vertex es; of Ng any vertex éa; of N. satisfying the con- 
dition that 0,;Co.;. It is easily seen that the nerves N, with their spe- 
cial subcomplexes C, and projections o% constitute a simplicial spectrum 
{ Na, Ca, 2} called the simplicial spectrum of the locally bicompact space K. 
According to section 29 this simplicial spectrum generates the cell spectrum 


(30) K = {Kz, 02} 
called the cell spectrum of the locally bicompact space K. The Betti groups 
Ak = lim inf 2); Vx = lim sup (Vxq, 


of the cell spectrum K are called Betti groups of the space K and are denoted 
correspondingly by Aj, Vi. 

If the space K is bicempact, then so is its spectrum K and the previous 
definitions are simplified by C,=0, Ka= Na. 

31. The figures K, A, T and K, A, G. Let the notations of the previous 
section hold. Let A be a closed set in the locally bicompact normal space K; 
the complement T'=K\A is open. 

Denote by A.CK, the closed subcomplex of all simplices 7,€ K, satisfy- 
ing the condition: the associate set 0aiCGa of each vertex ea; of T, has points in 
common with A. It is easy to see that the projections 08 of Ng into N, defined 
in the last section satisfy the conditions of the Corollary 28.2 and thus define 
in addition to the cell spectrum (30) the cell spectra 


(31.1) A = 


(®) Our terminology is correct, each nerve being, for example, the nerve of the system of 
the open stars of its own vertices, 
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and 
(31.2) G = 

Now Ax, V& are by definition identical with A, Vx while, as we shall see 
in the next paragraph, there exist certain natural isomorphisms between the 
groups 

Aa and AA; Va and VA, 


Ag and AT; Vo and Vr. 


These isomorphisms transform the homomorphisms Ef, E%, Jk, J& into the 
homomorphisms Ef, EX, Jk, Jk and thus lead to the definition of the groups 
of the figure K, A, I. 


§9. THE NATURAL ISOMORPHISMS, THE GROUPS AND THE 
HOMOMORPHISMS OF THE FIGURE K, A, I 


32. Special types of coverings. Throughout this chapter K means always 
a normal locally bicompact space; by a covering (without any special at- 
tribute) is always meant a finite covering by open sets (of the given space). 

In any covering a= { Oat, Oae} of K we denote by - , Oap and 
call elements of the first kind the elements meeting A; the remaining elements 
Oa,p+1) * * * » Oae have no points in common with A and are called elements of 
the second kind. An element of the second kind is called a boundary or an 
inner element according to whether its closure has or has not points in com- 
mon with A. 

A covering a= {001, Oap, ** Oas) Of K is called cogredient with A 
if it satisfies the following conditions: 

1. Any two among the point sets A(\o;, i Sp, are different. 

2. If to, ***, 1,Sp and Oaigl \ #0, then Al (\0ai, #0. 

3. The bicompactness of A(\o,;, 1 Sp, implies that of da:. 

Let a= { Oat, Oap be a covering of K. Denote by ¢ the 
sum of all those sets 5.; which are bicompact and lie in I’. The covering a 
is called cogredient with I, if it satisfies the following conditions: 

1. The elements of the second kind of a form a covering of T’ (denoted 
by Ta). 

2. No element of the first kind meets ¢. 

REMARK. In my paper [1] coverings cogredient with I’ were called “regu- 
lar with respect to I.” 

The covering a= { Oat, ** *»Oae} is called regular (“regular with 
respect to A” in the terminology of [1]) if the following conditions are ful- 
filled : 

1. The covering a contains no boundary element of the second kind. 
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To finish with these preliminary notions take in K, all simplices T, with 
the following property: Each vertex of T, corresponds either to an element 
0ai€a of the first kind or to a boundary element of the second kind; these 
simplices form a closed subcomplex A.’ of K, and for 8>a the complex Ag 
is mapped by any of into A,’ , the conditions of Corollary 28.2 being satisfied. 
Therefore we have the cell spectrum A’={AJ, o{} and the still more im- 


portant for us cell spectrum 

(32.1) G’ = {Gian}, 

where G/ = K.\A/, and are the projections of Af, Gf into AZ, GJ 
generated by 08. 

32.2. Obviously if @ is regular, then A,’ =Aa, 

The following results are proved in detail in my paper [1] (one can read 
the proofs without reading the rest of the paper): 

32.3. Any covering of K has a regular refinement [1, p. 80, Theorem 6.22]. 

32.4. Any covering of K has a refinement cogredient with T—and even 
more: 

32.5. Any two coverings a and 6 of K have such a common refinement ‘y 
cogredient with IT that every element of the second kind of y is contained 
in an element of the second kind of a and in an element of the second kind 
of B [1, p. 89, Lemma 8.5]. 

32.6. For every covering ar of I there is a covering a of K cogredient 
with I and such that ar = Te [1, pp. 87-88, Theorem 8.33]. 

Let us prove now: 

32.7. Every covering a= { °° 
cogredient with A. 

Proof. Let 


(32.71) AM) AM Og, qs >, 


be chosen among the Af\o,:, - - - , A(\oay in such a manner that no proper 
subsystem of the system (32.71) is a covering of A. Take such closed sets 
a;<(AN\o.:), i=1, 2, - ++, g, as to get a closed covering of A. Choose for 
each a; a neighborhood Oa;Co,; under the following conditions: 

1. All Oa; are different. 

2. Any Oa;,, - - - , Oa;, have points in common only if a;,, - - - , ai, have. 

3. The bicompactness of 4; implies that of Oa;. The sum of Oa, - - - , Oa, 
is a neighborhood OA of A. Take the closed sets },CK\OA so as to get a 
closed covering {b:, ---, bu} of K\OA in such a way that each of the b, 
is contained in some 0,;. Then define Ob, as 0a:/\(K\A), where 0a: is any ele- 
ment of a containing b,. The covering - - - , - - - , Ob. has the 
required properties. 

Let a, be a covering of A. We write a, = Aa, a being a covering of K, 
if the elements of a, are in (1-1)-correspondence with the elements of the 


-, Oas} of K has a refinement 
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first kind of a, each element of a, being the intersection of A with the corre- 
sponding element of a. From 32.7 follows: / 
32.8. If we consider among the coverings of A only those of the form 
a, = Aa, where a is a covering of K cogredient with A, and set for these cover- 
ings AG > Aa only if 8 >a we get a cofinal part of the set of all coverings of A. 
33. A lemma on group spectra. 


Lema 33. Let the group spectra: 
(inverse) {Uar, ok} and {U., wf}; (direct) { Ua, and { Uz, x3} 
satisfy the following conditions: 


1. There exists an isomorphism $2 


of onto U,; of Ua onto Us. 


2. Bud>adr implies B>a. 
3. For every 
E Uar Uar 
we have 
ard £8 B B Bu ad @ ar 
Under these hypotheses if we replace the elements of 

each thread tiax}; each bundle {uar} 


by their images under ¢% we get an isomorphism between 
lim inf (Ua, and lim inf (U2, 08); lim sup and lim sup (U2, 7). 


The proof is easy and may be found in [1, p. 62, Theorem 3.61 ]. 

34. The natural isomorphisms between Aj, Vi, and A, V4. We return to 
the figures of section 31 but we change slightly our notations: we denote now 
by @ any covering of A which can be written in the form a= Aad, where ad 
is a covering of K cogredient with A (and satisfying the condition Aa\ =a). 
We use Naa, Na for the nerves of ad and a, write of for the projections of 
Nz, into Nx, while 2 denote now only those projections of Ng into N. which 
are generated by a projection o&. As usual K.,, Ke is the subcomplex of 
Nar, Na respectively formed by all simplices which have among their vertices 
at least one corresponding to an element of ad (of a) with bicompact closure; 
Aa is the subcomplex of K., defined by the vertices which correspond to ele- 
ments of ad meeting A. 

The passage from ad to a= Aad produces an isomorphic mapping(®) of 


(®) Let N, N’ be nerves, Q, Q’ subcomplexes of N, N’ respectively. We define [Q] CN as 
the nerve consisting of all simplices of Q and of all faces of these simplices; the same for [Q’]. 
Any (1-1)-mapping of Q on Q’ generated by a (1-1)-simplicial mapping of [Q] on [Q’] is called 
isomorphic. 


} 
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A... onto K, and therefore an isomorphism ¢2 of the groups A’jo, Vino (of Aaa) 
on the groups Aj, V;, (of Ka). The isomorphism ¢% satisfies the hypotheses 
of Lemma 33 and thus generates a well defined isomorphism 


of lim inf (Ano, on lim inf (A, w.); 
of lim sup (Varo, sna) on lim sup (Ves ms). 
Since the coverings aA form a cofinal part of the set of all coverings of K while 
the coverings a form a cofinal part of the set of all coverings of A we may 
identify 
lim inf with lim sup (Varo, with Vi, 
lim inf with 44; lim sup (V2, 3) with V4 


and get in this way the natural isomorphism (A A) of 44, V4 on Ay, V4; the 
inverse isomorphism is denoted by (A A); we return to them in section 37. 
35. The natural isomorphism between Aj, Vp and AG, Vg. Now we de- 
note by @ any covering of I’, by ad an arbitrary covering of K cogredient 
with I’ and such that TaX=a. Then Naa, Na, Kaa, Ka have an obvious mean- 
ing. We retain among the inequalities Bu>ad only those for which there 
exists a projection o& which assigns to each element og,€fu of the second 
kind an element 02,GaA, 04,Gog,, of the second kind. Only these o%% will 
be taken into account. In this way we get by virtue of 32.5 a cofinal part 


(35.1) Ki= {Ka, wen} 


of the spectrum K. 
According to section 32 we denote by Gj, the open subcomplex of Kan, 


the simplices 7%, €G;, being defined by the following property: among the 
vertices of T., there is at least one corresponding to an inner element (of 
the second kind) of the covering ad. We denote by wi the projection of the 
cell complex G4, into the cell complex G, generated by o%. It is easily seen 
[1, p. 88, formula (8.41) ] that K.=G{, for every ad; thus we may identify 


the group 
A. = Ax, with = ve = VK. with Vien’ = Vary 


and therefore 


= lim inf with AG- = lim inf 


Vr = lim sup (Va, with Vo = lim sup 


But since the coverings aA (cogredient with I) on the one hand and the regu- . 
lar coverings on the other form both cofinal parts of the partially ordered set 
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of all coverings of K we get (taking into account 32.2) a natural isomorphism 
between 


Ag and Ag; Ver and Vo, 
and therefore between 
Ar and Ag; Vr and Vo. 
This isomorphism (considered from F to G) is denoted by (IT G), the inverse 
isomorphism by (G I). 
36. The groups of the figure K, A, f and the results concerning them. 


Because of the identities Ay =Ak, Vg =Vx the natural isomorphisms trans- 
form the homomorphisms 


(of the groups A4, VG, Vk, Ax into the groups Ax, Vk, V4, 4g) into the homo- 
morphisms 


of the groups 


Aa, =r, 


into the groups 
Ak, 
and thus define the groups 
Aka = = Ex(AA)AA = = = ERVE = Vc, 
= = = (Aa)Vax: ATK = = Grace, 
= OK = (AA)(Ex)' Ox = = (EK) OK = 
Va:K = VA — VaK = — (AA)Vax = (4A)Va:x; 


If we define the product of an element of A’ (of V'p) with an element of V4 
(of Ap) as the product of the corresponding elements of A4, V4 (of VG, AG), 
we see that not only all isomorphisms but also all dualities and annihilations 
between the groups of the figure K, A, G are transformed by the natural 
isomorphisms into the corresponding relations between the groups of the 
figure K, A, I. Therefore all results (14.1)—(14.3’) formulated in section 14 
for the figure K, A, G hold for K, A, T. 

37. Remarks on the natural isomorphisms and the groups of the figure 
K, A, IT. Let an element of the group V4, (of the group 44) be given. Take 
only the coverings ad of K cogredient with A. Using the subscript ad0 for 
V-cycles (A-classes) on Aa, we can write for any 


| 
| 
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= 2 = 


As we know A, can be identified with the nerve of a= Aad; therefore 2’, 2” 
can be considered as elements of V4, A and this identification represents an 
isomorphism—our natural isomorphism—(A A). 

For a moment, let us denote by coverings of I; let {2} 
Each % can be considered as a Y-cycle on Gi, thus (Gi, CG,, being open on 
as a V-cycle on G.,, where aA is cogredient with and a= Tak. Conserve 
among the coverings of K only those which are cogredient with T and among 
the projections o% of these coverings only projections which map elements 
of the second kind of Bu on elements of the second kind of aA. Then we can 
identify our 2” with a certain element of V@ and this identification is the 
natural isomorphism (I G). We needed the regular coverings only to prove 
that this isomorphism maps Vf onto V@. 

Now let @ be a covering of K. Let . 


8g = {Ba} Ac. 
Assign to any %,€3%, the cycle 


x Gar 
Z(a)' = 


(this is a customary method if we consider 2, as a Lefschetz relative cycle 
mod A, and keep in mind that A DA.). We get in this way an isomorphism 
of AG on AG (the “on” is again proved using the regular coverings). If now 
we take into account only the coverings of K cogredient with I we can iden- 
tify 2% = { %.} GAG with a certain element of Aj. Thus by the transition 
from 4% to AG and then to Ap we get the natural isomorphism (G I) of AG 


onto A’. 
To the simple intuitive meaning of the natural isomorphisms corresponds 


a direct definition of the groups of the figure K, A, I. Since Ayg=Aja, 
Ver=Vxo we are interested in the groups 
ATK: VT:k only. 

The elements of the bundle 2°€V% being identified with the YV-cycles zy 
on Aga, ad cogredient with A, the bundles 2*= {2,9} belonging to V'4p are 
characterized by the property: there is for each %,9€2" a Bu>ad such that 
is extensible over Kg,. 

The elements 2"CA, are characterized in the following way: we first 
identify the elements of the thread 2 with A-classes of Gj, (ad cogredient 
with I) and thus identify the thread 2 with a certain thread 2’ of the spec- 
trum G’; then using regular coverings we identify the thread 2’” with a thread 
of the spectrum G; this latter must be extensible in the sense of section 26. 

The elements of Ay. are threads 2A bounding on K; it means that 
if we consider 2,.€3,€2 CA's. as a A-cycle on Aa, (ad cogredient with A, 
Aad =a), this cycle bounds on Kay. 
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The elements of V/p.¢ are bundles 2°€V‘p bounding on K; it means that 
if we consider the elements of the bundle 2° as cycles on G., (ad cogredient 
with T), these cycles are elements of the bundle representing the zero of Vx. 
In other words, if %,€z’ and z"€V7jp, then there exists a Bu>ad such that 
bounds on Koy. 

As a matter of fact, the elements of the groups V'4:~=V4—V'ax can be 
defined as extension classes of V-cycles: let a, 8 be two coverings of A; define 
N., Ns, Ka, Kp as in section 34; two V-cycles 2%, 2 on K., Ks respectively 
belong by definition to the same extension class if there exist coverings ad, » Bu, 
yv cogredient with A such ‘that a=Aad, B= >ad, Bu and 

— 1°62, be a V-cycle on A», extensible over Ky». 

The elements of Aj.~ could be defined as extension classes of threads: 
two threads { 3%} and { 37} belong by definition to the 
same extension class if. each the cycle %—27 (where 37) 
considered as a cycle on Gi, (ad cogredient with T' and Tak=a) is extensible 
over Kay. 

38. Another form of the definition of the natural isomorphism (A A). 
This other form will be used in Chapter IV, remark 1 in section 43. We con- 
serve the notations of sections 30 and 31 and suppose for the sake of simplicity 
that K is bicompact. Take in each K, the closed subcomplex A?’ with the 
same vertices as A, and simplices | ei tae e:,| corresponding to those ele- 
ments 0;,, - - - , 0:, of the covering a of K which satisfy the condition 


Obviously Aj’ is a subcomplex of A, and 
(38.1) Ai’ =A. for a cogredient with A. 


Now denote by ad the coverings of K. To any { 
corresponds the covering 


a = Aad = {Ao;,---, Ao,} 


of A where the Ao;= A(\o; are “indexed sets” (in the sense of [1, p. 72, §5]), 
that is Ao; and Ao; are considered as different if 17. 

Each covering a of A can be put in the form a= Aad. The nerve of Aad 
being A) there is a natural isomorphism between and 


Va’ = lim sup = lim inf 


By virtue of the identity (38.1) this isomorphism turns into an isomorphism 
between V4, 44 and V4, 44 which is our natural isomorphism (A A). 
§10. THE CASE OF CONTINUOUS POLYHEDRONS K anp ACK 


39. An invariance theorem. Let K be a finite polyhedron, K a simplicial 
decomposition (Simplizialzerlegung) of K (terminology of [4, pp. 128-129]). 


| 
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Let A be a closed subcomplex of the complex K; then A denotes the poly- 
hedron composed by the simplices of A; as usual G=K\A, T=K\A. Let us 
show that the groups of the figure K, A, T are isomorphic to those of the figure 
K, A, G. To this end denote by 


Ki, Kn, 


the successive barycentric subdivisions of K; then the mth barycentric sub- 
division A, of A is a subcomplex of K,. Denote for a moment by Xj the 
covering of K by the closed barycentric stars of the vertices of K, and take 
the open neighborhoods of the elements of \, so closely as to get an open 
covering X, of K similar to A, (that is, having the same nerve as )4) and 
cogredient with A. Put a,=AX,. Obviously the A, and the a, form cofinal 
parts of the sets of all coverings of K and of A respectively. The nerve of An 
being K,, and A, being the nerve of a,, the projections of K,4; into K, and 
of A,4, into A, map any vertex én4: of Kx4: (of An41) on a vertex of the carrier 
of én4: in K, (in A,) and thus are “natural displacements” (“natiirliche Ver- 
schiebungen” in the sense of [4, p. 349]). Therefore by a well known standard 
process the elements of Ay, A’ are in (1-1)-correspondence with the elements 
of Ax, 44, this correspondence realizing an isomorphism between 


Ay and dx, and di, 


Ara and Aime, ALK and 


All other groups of the figures K, A, and K, A, G can be derived respec- 
tively from Ay, Axa, and Ak, Axa, 44:x by the same algebraic construc- 
tions; thus the groups of K, A, I are respectively isomorphic to the groups 
of K, A, G. 

40. The general polyhedral case. Let K and ACK be continuous poly- 
hedrons (that is, topological images of finite polyhedrons). In this section 
coefficieut-groups are the so called “elementary groups,” that is discrete groups 
with a finite number of generators and their (bicompact) character groups. 
Any subgroup and any difference group of an elementary group being elemen- 
tary, it follows that the groups Ay, Vx, 44, Va, Ska Ver Sax Varn 
as well as (by the third pair of isomorphisms) the groups Af.x, VP. forr21 
are elementary. Now if—for a discrete group X—a subgroup U and the dif- 
ference group X — U are both elementary, then X itself is elementary (the 
proof is obvious, see for instance [4, p. 576, section 38]). By this lemma and 
the first pair of isomorphisms the groups Af, Vp for r21 are elementary. 
If X is the coefficient-group then, as is easily seen, V(X) is the direct sum 
of as many groups isomorphic to the group X as there are compact compo- 
nents of I’. As in our case the number of these components is finite, V'p is 


elementary. 
40.1. Thus if K and A are continuous polyhedrons, then with respect to 
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any elementary coefficient-group all groups of the figure K, A, I are elemen- 
tary; they are all finite if the coefficient-group is finite. 

For a finite coefficient-group X the order of the group VE is equal (by the 
second pair of isomorphisms) to the product of the orders of Vgyr and Vax. 
This holds in particular for X =I, whatever the integer m22. Since by a 
theorem of M. Bockstein [6] all groups Vy(Z) are completely determined by 
the orders of the groups Vx(m) taken for all m and all 1, it follows that all 
Vx are determined by Vigr(m) and V'4x,(m) (taken for all m and all r). In the 
same way by the first pair of isomorphisms the groups A’, are determined by 
4’y:K(m) and Aj, (m) taken for all m and all r. To get an analogous result for 
the groups Vp, Vr, Vr we need the following 


Lemma. The groups V'p(m) (even the groups V'p(X) for any X) can be deduced 
from the groups Vp(I) (taken for all r) by means of the same relations as in the 
case where T is a finite cell complex. 


Let us assume for a moment that this lemma is proved. Then as the Bock- 
stein theorem for a complex K rests solely upon the algebraic relations between 
Vx(J) and Vx(m), its original proof still holds for Tf’. Thus our lemma im- 
plies the following: 


BOCKSTEIN THEOREM FOR I. The groups vr) are completely determined by 
the orders of the groups Vy(m) taken for all m and all r. 


Now by the first pair of isomorphisms the order of Vp(m) is equal to the 
product of the orders of Vip-.x(m) and and therefore V'p(J) is deter- 
mined by the groups and Vigr(m). 

It remains only to prove the lemma. A proof of it is contained in the con- 
struction of Steenrod [13, section 11] which in our case gives the desired 
expression. But one can proceed more quickly by the following argument of 
Bockstein which uses only the final result of Steenrod and not his intermediate 
construction. The groups Aj(J) being groups with a finite number of genera- 
tors, they are respectively isomorphic to the groups Vy(J) where N is an ap- 
propriate nerve (see for example [4, p. 266, section 9]). Let N be the 
corresponding polyhedron. By the fundamental theorem of Steenrod the 
groups Vy(m) can be expressed through the groups Vn(J) and these expres- 
sions depend only on the groups involved (and the given coefficient-group, 
in our case J,,) and are therefore the same for I and N. On the other hand, 
the groups Vn(J), Vn(m) are isomorphic to Vy(J), Vy(m) and the expression 
of Vxn(m) through the groups Vx(J) is the same as the expression of Viy(m) 
through the groups Vy(J), which proves the lemma. 

Thus all results announced in section 14 are now completely proved. Let 
us remark that the theorems of section 18 could be extended by the same 
methods first to spectra and then to spaces; such an extension does not pre- 
sent any serious difficulty and may be left to the reader. 


| 
| 
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CHAPTER IV. MANIFOLDS 
§11. INFINITE COMPLEXES 


41. Infinite complexes(’). Let Q be an infinite locally finite, complete (sec- 
tion 28), simplicial complex, thus a simplicial decomposition of a locally bi- 
compact space (of an infinite polyhedron) Q. Denote by LQ the group of the 
r-dimensional finite chains on Q with respect to the discrete coefficient-group 
X, by TQ the topologized (as in [13, p. 691]) group of all (infinite) r-dimen-: 
sional chains on Q with |respect to the bicompact coefficient-group =| xX. 
The topology in Z is that which makes of ZQ the direct sum of the groups Jj, 
the group Lj = & being the group of all chains of the form a4, where & is an 
oriented simplex of Q and 4 runs over &. Since Lj is the direct sum of the dis- 
crete groups Lj~X (the elements of Lj being the chains a4 with a running 
over X), the groups LQ and TQ are dual to each other. With these groups we 
get in the usual way the groups AQ and VQ: the group VQ (the group AQ) is 
the difference group of the group of all r-dimensional finite V-cycles (infinite 
A-cycles) over the subgroup of the bounding ones (a bounding Y-cycle bounds 
by definition a finite chain). 

The purpose of this section is the proof of the following theorem: 


41.1. If Q is a simplicial decomposition of the infinite polyhedron Q, then 


(41.1) Ao ~ Va: 


By virtue of the dualities Ag| Vo, 4Q| VQ it is sufficient to prove the first 
of the isomorphisms (41.1). 

PRELIMINARY REMARKS. Let a be a (finite open) covering of Q; an element 
of a (as well as the corresponding vertex of the nerve N,) is called regular 
if its closure is bicompact; the other elements of @ (and vertices of N.) are 
called special. The simplices of N. whose vertices are all special form the 
special subcomplex C, of N.; the open subcomplex K,=N.\C, is called the 
regular part of N.; all regular vertices of N., and only the regular vertices, 
are zero-dimensional elements of Ka. 

The open finite subcomplex G of Q is called a regular subcomplex of Q 
if it is the sum of the open stars(*) of some of the vertices of Q, or (which is 
the same) if each simplex of G has at least one vertex which belongs to G as 
an element. The proof of the following statement is obvious: If G is any open 
finite subcomplex of Q, while Q,, G; are barycentric subdivisions of Q, G re- 
spectively, then G; is a regular subcomplex of Qh. 

Our last auxiliary definition in this section is the following: A covering 
a of Q is called combinatorial with respect to Q if the following conditions are 
satisfied: 


(7) This section will not be needed in the rest of this paper. 
(*) We use the word “star” for complexes as well as for the corresponding point sets. 
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(a) Each regular element of a is the open star of some vertex of Q. 

(b) Let G, be the (regular) subcomplex of Q defined as the sum of all open 
stars which are elements of a; let T. be the corresponding point set in Q 
and T its closure (in Q). Any special element of @ either has no points in 
common with F or is of the form 0/,\UO;, where O, is on Q\T and 0, is the 
open star (with respect to Q) of a vertex of some simplex of Gz. 


Lemma 41.2. The regular part of the nerve N. of any covering a combina- 
torial with respect to Q is a regular subcomplex of Q and any regular subcomplex 
G of Q is the regular part of the nerve of some covering combinatorial with respect 
to Q. 

The first statement of Lemma 41.2 follows immediately from the defini- 
tion of a covering combinatorial with respect to Q. Let us prove the second 
statement. Let e; be a vertex of an element of G. If e; ©G denote by o; the open 
star of ¢;; if e; is not an element of G put 0;=0/ U(Q\I), where o/ is the open 
star of e; in Q. The open sets 0; form the required covering. 


Lema 41.3. Each covering a of Q has a refinement which is combinatorial 
with respect to a certain subdivision Q) of Q. 


Proof. Let 01, - , 0p be the regular and - , 0, the special elements 
of a. Take the closed sets a;Co; in such a way as to get a closed covering 
{a:i,---, of Q. The a; with i<p are bicompact while the ap4:, - @s 
may be supposed to be not bicompact. Choose the subdivision Q, of Q such 
that any star of Q, meeting a; is on 0;. Now let ou, ---, Or. be all stars 
of vertices of Q, which meet any one of the sets ai, - - - , @p. These stars form 
a regular subcomplex Gy of Q,; denote by I’, the corresponding subset of Q. 
Denote by O;, h=p+1, - - -, s, the open sets composed by all stars of Q) 
which have points in common with a,. Among all zero-dimensional elements 
of Q,\G, denote by e/, - - - , e¢ those which are vertices of elements of G). 
Define for i=1, 2,---,q 


where 0, is the star of e/ in Q, and h;=p+1 is chosen under the condition 


that a,, has points in common with 0,,. By the definition of O, we have 


thus 0,«4:Con, If in the latter construction O,41, - - - , are the only 
sets O, actually used, we put, for 7=q’+1,---,s—), 


ute+i = 


The open sets 0;,7=1, 2, - - - , u-+g+s—p, form the required covering. 
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Now we go over to the proof of the isomorphism Ag~Ag. Retain only 
those coverings of Q which are combinatorial with respect to subdivisions 
of Q. If a, 8 are combinatorial with respect to Q), Q, then we set 8 >a if both 
of the following conditions are satisfied : 

1. B is a refinement of a, @ is not a refinement of 8. 

2. Q, is a subdivision of Q). 

The coverings with this ordering form a cofinal part of the set of all cover- 
ings of Q. The subcomplexes K.=N.\C, being open (even regular) subcom- 
plexes of Q,, they undergo projections generated by natural displacements 
(section 39) of Q, into Q,. Let af be those projections. Then 


(41.4) Ag = lim inf 


Take on the other hand all finite open subcomplexes G,, of all subdivisions 
Q) of Q. We say that Gz, follows on G,, and write Bu >ad if Q, is a subdivision 
of Q, and the subdivision ssG., of Ga, in Q, is a subcomplex of G,,. In particu- 
lar, if G, are the subcomplexes of Q itself, then 8 >a means that G.CG,. 
Denote by of the natural displacements of Q, into Q, and write (as indices) 
(Bu)A instead of ofG,, and ad instead of G,,. Since Ga, is an open subcomplex 
of ofG,, we may define a projection p& of the cell complex G,, into Ga, by 


means of 


Bu (Bu)r 
PadrXpy = Jar 


for any chain x,, on This gives for the relation and 


in particular p£x,= on Q. 

Those G,, which are subdivisions of appropriate G.CQ form a cofinal 
part of the set of all G,,, and p2 maps isomorphically A‘, on Al. Therefore 
lim inf (Af,, p&) ~lim inf (4%, J®). But the groups lim inf (4%, J8) and AQ 
are isomorphic a. shown by Steenrod [13, pp. 691-692]; thus 


On the other hand, since the regular subcomplexes form a cofinal part in 
the set of all G.,, we deduce from Lemma 41.2 the isomorphism 


(41.6) Ao ~ lim inf (Aza, pan): 


From (41.5) and (41.6) follows Ag =~ Ag, q.e.d. 

42. The groups 49 and the duality theorem of Alexander-Pontrjagin. Let 
X be a discrete coefficient-group. Denote by 69 the difference group of the 
group of all r-dimensional finite A-cycles of the (infinite) complex Q over the 
subgroup of the bounding cycles (a bounding cycle bounds by definition a 
finite chain). In this section we suppose that Q is an n-dimensional combina- 
torial (4-)manifold (finite or infinite), that is, that for the open star O; of 
any vertex e; of Q 


=z 
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Ao(D) Vo) I, 


while for r<mn the groups AG,(J) ~Vo,(J) contain the zero element only. We 
suppose moreover that Q is orientable, which means that for every component 
Q: of Q, AQ(D) ~VG,(1) =I. The oriented (open) barycentric stars of Q 
form a cell complex Q*. If # is an oriented simplex of Q, then we denote by 
ti, with g=n—p, the barycentric star with the center in the center of @ ori- 
ented in such a way that the intersection number (4 X71) be equal to +1 
(see for example [12, §69]). Then we have in the cell complex Q* the incidence 
numbers 


(rf = (— 287 


Define now for any p-chain x? on Q the g-chain D%? on Q* taking on 7f the 
same value as x” takes on #. An easy calculation shows that 


(42.1) AD*tx? = (— xP 


whence it follows that the operator D* generates an isomorphism between 
VQ and 8%. As both 4 and 4% are isomorphic to 6%,, where Q; is the bary- 
centric subdivision of Q, we have the 


DUALITY THEOREM OF POINCARE. 


Vo 89, q=n-— 


Let Q be an n-dimensional continuous manifold, that is, a finite or infinite 
polyhedron, one and thus all simplicial decompositions of which are combina- 
torial manifolds. Then any open set on Q is also a continuous manifold (see 
[4, pp. 143-146, Theorem “of Runge” ]). There follows then from the duality 
theorems of Kolmogoroff and of Poincaré the 


DUALITY THEOREM OF ALEXANDER-PONTRJAGIN. If the n-dimensional ori- 
entable continuous mantfold Q is simply connected in the dimensions p and p+1, 
A is a closed set on Q, and T=Q\A, then the groups A and 5¢" are dual (the 
groups Vi, and oF ’ are isomorphic) to each other. Here the coefficient-group for 
df’ and V4 is the discrete group X while the coefficient-group for A% is the bi- 
compact group =| X. 


The generalization of this theorem for an arbitrary orientable manifold 
(not necessarily simply connected in any dimension) is given by the 


GENERAL ALEXANDER-PONTRJAGIN DUALITY THEOREM. For any n-dimen- 
stonal orientable continuous manifold K, any closed set ACK, and T=K\A, 
the group V4.4 1s isomorphic to the group OS .%, where the latter group is defined 
as the difference group of the r-dimensional finite cycles on T over the subgroup 
of the cycles bounding on T (coefficient-group discrete). 


| 

| 
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Here the finite cycles on I are to be understood in some invariant sense 
(as “continuous cycles” in the sense of [4, pp. 333-339] or as convergent 
cycles on compact sets ¢CT (section 43)) or to be replaced by the 6-classes 
of section 43. 

To prove this theorem we have to show only that the groups V’p._% and 
dp. are isomorphic. This can be done by the standard methods of proving 
invariance theorems (see for example [4, Chapter IX] and [3]); these meth- 
ods, however, so far as the writer sees, need at a certain moment the Combina- 
torial Lemma 45.3. But the same lemma is the only essential point in proving 
directly the isomorphism V4.4 ~ 5f:%- We prefer therefore to give this direct 
proof which rests entirely on section 15 (Chapter I), being practically 
independent of Chapters II and III. 


§12. THE Groups V'4:K, AND FOR A MANIFOLD K 


43. The groups A’.;, and V4._ for a polyhedron K. Let K be a polyhedron; 
we suppose this polyhedron finite (that is bicompact) although the following 
definitions hold for infinite polyhedrons too. As usual A is any closed set in K 
and '=K\A. In this case the groups A4y:x, V’4:~ can be defined in a quite 
elementary way, independent of the definitions and results of Chapters II 
and III. Let 


(43.1) Ki, , 


be a sequence of successive subdivisions of a given simplicial subdivision K 
of K; we suppose moreover that for any integer a21 the complex Kay, is the 
result of one or more successive barycentric subdivisions of K,. Taking in K. 
all simplices which meet A and all faces of these simplices we get a closed 
subcomplex Aq of K,. Put in correspondence to any vertex @a41 Of Kay a 
fixed vertex ¢.=0%*"e,4; of the carrier of é24: in Ka. This gives a well defined 
simplicial mapping (a “natural displacement,” section 39) called a projection 
of onto K,. The projection maps Aas: into For B>a+1 put 
8 a+1 a+2 B 
Ca = Ca Fati*** 

which implies the transitivity relation 02 =0403 for any y>8>a. 

The projection of generates a homomorphism of of L3= kg into L,=Lk, 
and a homomorphism of of Lj,=L4 into Lyo=L4,; the conjugate homo- 
morphisms and of the group into Lj and of Li, into Lj are defined 
by 


r Br 
(43 tg) (Xa* 


for any chain x, on K, (on A.) and any oriented simplex 4 of Ks (of Ag). 
The homomorphisms of, 7% generate homomorphisms (denoted also by 
af}, mo) Of Ajo into Ate (of Vio into Vg) and map into 
=4A4,:K,3 into Vios=V4,:x,- We define now: 
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AA‘K = lim inf (Sane @eo); = lim sup 


REMARK 1. It is easily seen that the groups A4:x, V4: just defined do not 
depend upon the special choice of the sequence (43.1). But one can also show 
that the definitions of this section agree with those of Chapter III. In fact, 
the simplices of A, are characterized among the simplices of K, by the prop- 
erty that their open stars meet A; therefore A, is the nerve of the covering 


Gust 


of A, where 0,;= A(\O,; and the O,; are those open stars of (the vertices of) 
K, which meet A. In other words, if ad is the covering of K by the open stars 
of the vertices of K, then in the sense of section 38 we have a= Aad. The 
coverings aA, a=1, 2, ---, form a confinal part of the set of all coverings 
of K and the projections 0% put in correspondence to each O,; an Oai2Ov;. If 
we take into account section 38 we see at once that the definition (43.2) gives 
us the same groups as the definitions of Chapter ITI. 
REMARK 2. The elements of A’4y:~% are those “threads” or sequences 


r Ff r r r r BO r r 
= (31 Aa, ta © Aco, = 


whose elements 3, CA‘, are classes of cycles on A, bounding on K,. 

It is easy to see that the group A4:, can also be defined as the difference 
group of the group of all r-dimensional convergent cycles (see for instance 
[5, p. 241]) of the set A over the subgroup of cycles bounding on K. 

The elements of V4.4 are extension classes of r-dimensional V-cycles; the 
V-cycle z, on A, and the V-cycle % on A, belong by definition to the same, 
extension class if there exists a y>a, 6 such that 1%z,—7%25 is extensible 
over K,. 

REMARK 3. The coefficient-groups in A’y.%, V'4:~ are dual: bicompact for 
and discrete for 

44. The groups 6)., for a manifold K. From now to the end of this paper 
K is a continuous n-dimensional manifold (definition in section 42) which 
for sake of simplicity we suppose closed (that is bicompact) although all 
following definitions and results hold without this restriction. We conserve 
the notations of section 43 and denote by K.: the barycentric subdivision of 
K, (thus K.4: either coincides with K.; or is a subdivision of K.1). The com- 
plex of all (open) barycentric stars of K, is denoted by K.2*. The barycentric 
stars which correspond to simplices of Gz=K.\Aa form a closed subcomplex 
G of K.*. Since the elements of K.* are subcomplexes of Kai, we get a closed 
subcomplex Gx, of Ka1, called the barycentric subdivision of G.*, if we take all 
simplices of K.: lying on elements 7 of G* (that is all simplices which are 
elements of the subcomplexes T7.*C€G,*). The polyhedron composed of all 
simplices of is denoted by I,. It is CT and even Without 
loss of generality we may suppose that I'41CT'(a411. Therefore the subdivision 
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of Gj, into simplices of Km, B2a, is a subcomplex s$iGz, of Gj,, and for any 
chain on Gai we have the subdivision on 
REMARK 1. By definition 


alr r r r 
= (%a1°ter) 


if the simplex 4, of Gj, lies on &, and its orientation is coherent with the ori- 
entation of &,; (sgixi,-f,) =0 if 4, lies on a higher dimensional element of 

REMARK 2. In an analogous way we define the subdivision s§,x, of a chain 

x, on Obviously for B>a 
= 

After these preliminary remarks we define the group dfp._ as follows. We 
say that two A-cycles, the cycle z,, on Gj, and the cycle 2, on Gj,, belong to 
the same r-dimensional 5-class on T if there is a y >a, B such that s*iz,, — Sigh, 
bounds on G?,. To get the sum of two 5-classes 2” and 2’* take ad libitum 
E32", and define 2’+2’" as the 5-class containing s%}z,, where 
7 is any integer which is greater than or equal to the greatest among the in- 
tegers a and B. 


DEFINITION 44.1. The discrete additive group of all r-dimensional 5-classes 
on T is called the group Sy (with respect to the given discrete coeffictent-group). 


DEFINITION 44.2. The subgroup Sp.% of Sp is formed by the 5-classes 2° 5p 
whose elements 2,, bound on (the corresponding) Kay. 


REMARK 3. The groups dp, dp.~ could be defined also as follows. A con- 
vergent cycle on a compact subset of I’ (with respect to a discrete coefficient- 
group X) is called a cycle on I; by definition, it bounds on I if it bounds on 
a compact subset of I’. The difference group of the group of all r-dimensional 
cycles on I over the subgroup of the cycles bounding on I is the group dy; 
the subgroup of 4 whose elements are classes of cycles bounding on K is the 
group 5p.x; the latter group can also be defined as the difference group of the 
group of all r-dimensional cycles on I over the subgroup of the cycles bound- 
ing on K. 


§13. THE GENERAL ALEXANDER-PONTRJAGIN DUALITY THEOREM: 
REDUCTION TO THE COMBINATORIAL LEMMA 


45. Formulation of the theorem and of the lemma. From now on we either 
suppose the manifold K orientable or take for the coefficient-group the group 
I». All notations of §12 hold. 

The theorem to be proved is: 


45.1. The groups and (q=n—p) are dual; 
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or (equivalent formulation) : 
45.2. The groups and (g=n—p) are isomorphic. 
The proof rests upon the following 


COMBINATORIAL LEMMA 45.3. If 22 is a V-cycle on K. equal to zero on Aq 
then for any B>a we have 
(45.3) © on 
(co ts the sign of homology, section 9, D* is the duality operator of section 42). 

This lemma will be proved in the next paragraph; in the present paragraph 
we suppose the lemma proved and prove on this basis the Theorem 45.2. 
We write E, insteatl of Ex. 

46. The operators AD‘E, and the isomorphism AD‘E. To each V-cycle 
2, on A, corresponds the Y-cycle VE.2%% on G, and thus the A-cycle 
Dt" ADE, 22, on G* bounding on K.*. If a Y-cycle 22 on 
bounds on G, the chain x2~', then bounds on the chain + 
and vice versa. Therefore from section 15 we have: 


46.1. The operator AD{E, produces an isomorphism of the group Vi,:k. 
onto the group Ag--.x,+. 


Our purpose is to show that the operators AD‘E, taken for different a 
produce an isomorphism—we call it AD*E—of V%4., onto df. We need for 
this proof some minor lemmas. 


LemMA 46.2. For any V-cycle 22, on A, and B>a 
Proof. As 73 commutes with V we have to prove the homology 


Vrs on Gz 


— © 0 on Gp. 


But an easy direct calculation shows that the chain 1§E.2% — Egngo%o is itself 
on Gs, which proves the assertion. 
The fundamental lemma follows: 


Lemma 46.3. For any V-cycle 22 on A, and B>a 


Proof. Set in (45.3) 


= 
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instead of 22; it becomes 
Apply to both sides of (46.2) the operator D§~’; this gives 

Thus on the left-hand side of (46.31) we can replace D§-'x§VE.2% by 
and get 
But by (42.1) 

Ds = (— 
{ Ds VEatao = (— ADSE 


If we bring this into (46.32) we get (46.3). 
It is now easy to prove that the homomorphisms AD{E, taken for differ- 


ent a generate a homomorphism AD‘E of into since every AD{E 
bounds on K.* we need only to verify the following 


Lemma 46.4. If for the V-cycles 22 on Aq and Zp on Ag there exists a y>a, B 
such that is extensible over K,, then | 


Proof. From 46.1 it follows that AD!E,(x%22o—7%2%) bounds on G,*, 
thus 


(46.41) © 290 On 
On the other hand (46.3) gives (with 8 replaced by 7) 


and (with a, 8 replaced by 8, y respectively) 


These homologies substituted into (46.41) give (46.4). 
46.5. The homomorphism AD‘E of V4:% into 5f:% is an isomorphism. 
This follows from 


LemMA 46.51. If we have for a V-cycle 2% on Aq and a certain B>a the 
homology 


© 0 on Gor, 


then goto is extensible over Ks. 
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Proof. By (46.3) 


thus sf, AD{Egnr3o22%) bounds on Gp;. It is known (see for instance [4, p. 247, 
Theorem III]): If a cycle of the form s§,2*, where z/ is a A-cycle on G#, 
bounds on then zs* bounds on G*. Therefore bounds on G#* 
and by 46.1 the V-cycle 19225 is extensible. 

46.6. The isomorphism AD‘E maps V4.% onto df... Let z¢-' be an element 
of 5f.j. It is known (see for instance [4, p. 246, Theorem III]): Any A-cycle 
on G;, is homologous on G3, to a cycle of the form s%,2* with 2* on G,*. It 
follows that the 5-class z*-' contains a cycle s%,z.*, where z.* is a A-cycle on 
G+ and bounds on K.*. From 46.1 it follows that there exists a V-cycle 2% 
on A, such that AD{E,22% is homologous to z.* on G,* and therefore 


su 
which proves our assertion. 


§14. PRooF OF THE COMBINATORIAL LEMMA 


47. The fundamental identity. We write in this paragraph T for non- 
oriented and ¢ for oriented simplices. The simplex 77, of Kai whose vertices 
are the centers of the simplices T7*> - - - >7;" will be denoted by 


Ta=|To >To. 
A simplex Ty, of Ka: is called principal if its dimension is the same as the 
dimension of its carrier in K.*; the other simplices of K.1 are called accessory. 


Among all simplices of K.: the principal simplices and only these are of the 
form 


If we put in correspondence to the center of each simplex 7, of Kg the center 
of the simplex oT, we get a simplicial mapping of! of Kg into Ka: for 
Ty =|T;'>Ts'> -- - CKm we get 

Remark 1. If of does not degenerate on Tz (that is, if the dimensions of 
o£T and 7 are the same) then for any simplex 7}, lying on 73 the simplex 
ofi Th, is the image of Tj, under the affine transformation of of T;. 

The manifold K being orientable, choose any definite orientation of K. 

-and the corresponding orientation of Kg as well as of all 2-dimensional sim- 
plices TZ and 73 of K. and Kg. These orientations will be denoted by &%, &. 
For p< take ad libitum a definite orientation of each p-dimensional simplex 
T2€K, and denote this orientation by #. Choose the orientation 7% of the 
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corresponding barycentric star TZ in such a way as to get the intersection 
number For take the orientation #, corresponding 
to 
Choose as follows the orientations # of the simplices T} of Ks. If of does 
not degenerate on 7}, and of73=T72, define % by the condition off=2&. 
If of degenerates on 7}, choose # arbitrarily. The orientation 7$ of the bary- 
centric star Tj corresponding to 7} is again defined by the condition 
(4X73) = +1. Finally, the orientation 4, of corresponds to 75. 

The proof of the combinatorial lemma 45.3 rests on the following 


FUNDAMENTAL IDENTITY. 
(47. 1) Xa. = SaiDaXea 
for any chain x2 on K, and any B>a. 


REMARK 2. The mapping of (of Ks on K.), being a natural displacement, 
has the degree 1 (see [4, p. 348, Theorem II]). If of is an arbitrary simplicial 
mapping of the degree d of the arbitrary n-dimensional closed orientable 
manifold on Kg on the arbitrary n-dimensional closed manifold K., then 
(47.1) is replaced by 
(47.2) : xe = 
for any chain x2 on Kq where the operator 7 is defined as usual by (1x2 - 4) 
= (x? -08#) for any oriented 4€ Kg. In this general form the identity (47.2) 
formulates a known property of the Hopf “Umkehrhomomorphismus” (see 
for instance [7]). The proof given below involves this general case. 

48. Proof of the fundamental identity. Let us see how the principal sim- 
plices are transformed under o4}. First of all: 


48.1. If the accessory q-dimensional simplex 
Ti >--->T."| Kas 
is the image under of the principal simplex --- >T7}|, 
then n,<p. 

Proof. From oT, ‘= Tx‘ follows n;Sn—i, in particular mp Sn, n,Sp. If 
n,=p, the simplex 7%, would be principal, and 48.1 is proved. 

The principal simplices Tj, mapped under of} on the principal 
T,=|T2> tee >T?| are easily seen to be in (1-1)-correspondence with 
the Ks mapped under of on T2. Moreover the orientations %, 2, 
&1, 4 have been chosen in such a way that from 7}, = | Tz>--- >T?| and 


(48.11) oly = he (with e = +1) 


follows 
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(48.12) = 


with the same ¢ in (48.11) and (48.12) (the simple proof uses remark 1 of 
section 47). Therefore for any principal simplex #, 


B B n 
where the sum is taken over all principal 4, at the left and over all -dimen- 


sional simplices of Kg at the right. Since the value of 02) °% on & is equal to 
the degree d we get 


(48.2) on doin 


(summation on both sides over all g-dimensional principal simplices of Kg 
and respectively). 

The proof of the fundamental identity presents no more difficulty. To 
simplify the notations put 


8B a 
yar = xa. 
We shall calculate the value of the chain yg, on an arbitrary simplex 4, of Ka. 
If #1 is accessory this value is zero by the very definition of the operator s};. 


If %, is a principal simplex, let 7j be its carrier in Ks* and @ the corresponding 
(oriented) simplex of Ks. By the definition of s§,, Df and 73 we have 


(48.3) = (x2-oet3). 


To evaluate the left-hand side of the identity (47.2) take any 73,C Ka 
and all principal simplices 


which are mapped by of! on T?,. Let 
= € ilar, 
We have by (48.3) and the definition of the ¢;: 
i i 
If T%, ts accessory then by (48.1) we have n,<p; it follows—since 
T3(j) = T2-—that (x2 =0, therefore by (48.4) 
= 0. 


Obviously (s%,D£x2 - #2,) =0, thus both sides of (47.2) take on the accessory “4; 


the value zero. 
If Ti, is principal, Ti,=|TzZ> --- >T2|, then o£7}(j)=T2, and even 
o5t3(j) =2, hence by (48.4) 
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= = (2-2) Dg. 
i i 


By (48.2) >-je;=d and therefore 
= d(xa-te). 


So much for the left-hand side of (47.2). At the right, the chain s%,D£x2 takes 
the value (x2-#) on é,, which completes the proof of the identity (47.2). 

49. The rest of the proof of Lemma (45.3). This is based on the following 
two elementary propositions: 

49.1. Let I’., P'%, denote the set theoretical sum of the simplices of G., Gi, 
respectively. There exists a continuous mapping C of the point set I’, onto 
the point set I'%, leaving all points of I'%, invariant. 

49.2. Let the V-cycle 2 on Ka be equal to zero on Aq; if 2, =s,D§ugz2 
differs from zero on a given 73, © Km then a convex subset of I’. contains both 
the closures of 7}, and of of{T}.. 

Proof of 49.1. The open set I, is the set theoretical sum of the simplices 
Tai€ Kai lying on it. Each of the simplices T.1€ Kai lying on I, but not be- 
longing to G%, is of the form 

|, 
where 7%*,---, Tx* belong to G., while Ty are elements 
of A,. Thus each point a of TZ, is on a segment |a’a’’| 7%, joining a 
point a’ of the simplex Tz, with a point of 
= | the simplex 7%; belonging to the barycentric 
subdivision Aa of Ae. If we put C(a)=a’ for aET.\T%, and C(a) =a for 
a€\I%, we obtain a continuous mapping C with the required properties. 

Proof of 49.2. Let 7* be a simplex, 7? a face of 7”. Denote by S(T”, T*) 
the complex composed of 7” and all faces JT’ of 7” satisfying the condition 
T’=T» (where T’>T7”’ means that the simplex T’’ is a face of T’). Denote 
by >-(T?, T*) the set theoretical sum of all elements of S(T”, 7"). Among all 
points of the closed set 7* the points of }-(7®, 7) are characterized by the 
property that their barycentric coordinates (in the coordinate system defined 
by the vertices of 7") corresponding to the vertices of T? are all positive. 
Therefore the point set )-(7?, T*) is convex.. This obvious remark will be 
applied now to the proof of 49.2. The hypotheses of 49.2 imply that Tj, is 
a principal simplex, 

=|T;>--- > 
and that 7322 differs from zero on 73; hence 


Denote by Tz the carrier of Tj in K., and put 
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T. = 


The mapping of being a natural displacement, we have 


and therefore 
Since T2€G, and G, is open in Kz, we have 
S(T%, Te) S(Te, Te) Ge. 
The vertices of o4}7},, being the centers of the simplices 
= 0413 2 +++ = Te, 


are points of the set 72%). Since the point set TZ) 
is convex and contains the vertices of the simplex of{7}, it contains the 
closure of this simplex. It remains to prove that the closure of the simplex 
T}, is contained in the same convex set TZ). Let i=1,2,---, 4, 
be the carrier of T3~‘ in Ka. Since Tz is the carrier of Tz and T?=08T3 STs, 
we have 


To 


a 


whence it follows that 77, Tz’, - - - , Tz*and therefore the centers of TCTz, 
Ts are on Ta); since TZ) is convex, the 
closure of the - - - >7 | is a subset of Tz) 
and 49.2 is proved. 

It follows from 49.2 that the prism with the bases 2, and ofjzf, (see sec- 
tion 28) lies on a polyhedron ICT’, (composed of the simplices of this prism) 
and thus #, cofiz, on ICT, (this homology being understood in some in- 
variant sense—either as a continuous homology in the sense of [4, p. 335] 
or as a homology between convergent cycles [5, pp. 240, 241]). If we take 
%, from (49.2) we obtain 


On the other hand, we have by (47.1) 


q@ap 
sarDete = 


thus s%,D%22 (considered as a continuous or asa convergent cycle) lies on II 
and therefore 


= ete on II. 
Hence we may replace in (49.3) the chain of{s§,Dimg22, by s$,D!22, which gives 
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By the mapping C of 49.1 this homology turns into 


Blia@ap 
© On I'g;, thus on Gg, 


which completes the proof of the combinatorial lemma, as well as the proof 
of the general duality theorem of Alexander-Pontrjagin. 


§15. THE REMOVING THEOREM 


50. The removing theorem in the case of a manifold. A V-cycle 22 on K. 
is homologous on K, to a cycle on G, if and only if the A-cycle Dz is ho- 
mologous on K,* to a cycle on G,*. As the scalar product of 22 with a A-cycle 
22, on A, is equal to the intersection number of Dfz2 with the same 2%) we 
get the following form of 


THE REMOVING THEOREM FOR MANIFOLDS. A 6-class 2° on the (n-dimensional 
orientable) manifold K contains as a subset a 5-class on the given open set 
Ir'=K\A if and only if, whatever be the thread 2 = {32,} CA? and 2.Cz°, the in- 
tersection of # with any 2oC€ixo 1s equal to zero (here, as usual, the 5-classes 
are taken with respect to a discrete coefficient-group X, while the coefficient-group 
in is X). 


Another formulation of the same theorem is: 

A q-dimensional convergent cycle z* (with respect to the discrete group X) 
on K is homologous on K to a cycle on I if and only if the intersection of z¢ 
with each p-dimensional convergent cycle on A (with respect to Z| X) is equal 


to zero. 
REMARK 1. The theorem holds also for X = Z= (see section 5). 
REMARK 2. For non-orientable manifolds the theorem holds in the special 


case X = Z=_73. 
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INDEPENDENT INTEGRAL BASES AND A CHARAC- 
TERIZATION OF REGULAR SURFACES 


BY 
H. T. MUHLY(') 


Introduction. The fundamental paper Theorie der algebraischen Funktionen 
einer Verdnderlichen of Dedekind and Weber opened a wide field of research. 
The methods which these authors brought into play in the study of algebraic 
functions of one variable have lent themselves readily, from the conceptual 
point of view at least, to generalization to the case of several variables. How- 
ever, even if one restricts himself to the case of algebraic functions of two 
variables he finds a sharp line of demarcation in the analogy with the one 
variable case when he attacks problems of enumeration such as the problem 
of Riemann-Roch. 

Several years ago Zariski made the conjecture that the analogy between 
the theory of algebraic curves and the theory of algebraic surfaces would 
extend much further in the case of regular surfaces, that is surfaces whose 
arithmetic and geometric genera coincide. The objective of this paper is to 
establish the truth of this conjecture and to show that this fact is in a sense 
characteristic of regular surfaces. , 

A tool which proved to be most useful to Dedekind and Weber in deriving 
the Riemann-Roch theorem as well as other allied results was furnished by 
their so-called normal bases, the construction of which we shall describe briefly 
below. 

Let = be a field of algebraic functions of one variable x cover a field of 
constants K, which is assumed to be maximally algebraic in 2 and of char- 
acteristic zero. The field 2 is then an algebraic extension of K(x) of finite 
relative degree v. Let 0 denote the ring of integral functions of x in = and 0’ 
the ring of integral functions of 1/x in 2. If w is an element of 0, the “expo- 
nent” of w (in symbols exp w) is defined to be the smallest integer r such 
that w/x’ is an element of 0’ (that is w/x’Co’, w/x’-'{o’). A set of functions 
Ae, Ay is constructed as follows. Let A\1=1, and let be an element 
in 0 of lowest exponent 72, such that Az does not satisfy a congruence of the 
form \2=cA1(ox) where cE K. If Au, As, - - , Axa have been selected, take 
to be an element in 0 of lowest exponent which does not satisfy a congruence 
of the form EK, j=1, - - - , Dedekind 
and Weber show that this construction leads to a set of v( = [2: K(x) ]) func- 
tions Ax, Az, - - - , Ay in o which have the following properties: 


Presented to the Society, September 13, 1943; received by the editors September 10, 1942 
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(a) They are linearly independent over K(x) and therefore form a modu- 
lar basis for 2 over K(x). 

(b) They form a modular basis for o over the polynomial ring K [x]. 

(c) If exp A;s=r;, then ri(=0), 1SreS --- Sr,. If w is in o, and if r, is 
the last integer in the sequence 7, fe, - - - , 7, which is less than or equal to 
exp w, then w is of the form w=P;(x)Ai+ - - - +Pa(x)As, and the degree of 
P(x) (a polynomial) is less than or equal to (exp w—r;). 

(d) The quantities =\,;/x"‘, i=1, 2, ---, form a base for 0’ over 
K[1/x] with the same properties relative to 0’ as the quantities Xu, -- -, », 
have relative to o. 

(e) =p, the genus of 2. 

Dedekind and Weber refer to such a set of functions as a normal base for 0 
over K [x]. 

When we pass to the case of several variables, x1, x2, - - - , x,, and let = 
denote an algebraic extension of K(x1, x2, - - - , X-), we must consider together 
with the integral closure 0» of the polynomial ring K [x1, x2, - - - , x,] the inte- 
gral closures 0; of the polynomial rings K [x:/x;, - , xi-1/xi, 1/24, 

-+, x,/x;], i=1, 2,---, 7, if we want to generalize the concepts men- 
tioned above. It will be pointed out in §2 that it is possible to define an 
integer, exp w, for every element w in 09 (or in 0;) with the property 
that if expw=h then w/x?Co,, i=1, 2,---, A set of 
v(=[2:K(x1, x2, ---, x-)]) elements, Ax, Az, A» in 09 which form a 
modular base for 09 over K[x:, x2, - - - , x-] (and consequently form a base 
for 2 over K(x1, x2, - , will be referred to as a DW-base for 0» rela- 
tive to K [x1, x2, - - + , X,| if in addition it satisfies the following conditions. 

(a) If exp A;=r; then r:(=0), 1 Sr2Sr3S --- Sr,, and if wis in 09 and r, 
is the last integer in the sequence 7; $723 - - - Sr, which is less than or equal 
to exp w, then w is of the form w= P,(x)Ai+P2(x)Ae+ - - - +Pa(x)A, where 
the polynomial P; is of degree less than or equal to (exp w—7;). 

(b) The quantities =),/x}', 7=1, 2,---, 2,- have the 
same properties relative to the ring 0; as the elements Ax, Az, - - - , A, have 
relative to 0o. 

In §2 we show that if &¢*, &*, - - - , &* are the homogeneous coordinates 
of the general point of an arithmetically normal variety V, in P, and if every 
element of the integrally closed ring 0* =K[é#*, &*, - - - , &*] depends in- 
tegrally on the algebraically independent elements £¢*, &*, - - - , £* then the 
existence of a DW-base in 090 =K - - - , relative to the 
polynomial ring K - - - , is equivalent to the existence 
of a modular base for o* over K[é«*, &*, - - - , £*] consisting of linearly inde- 
pendent homogeneous elements. 

It is well known that such modular bases do not exist in general in rings 
such as o*, so that DW-bases likewise do not always exist. 

In this paper we shall prove that if &*, &*,---, &.* are homogeneous 
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coordinates along a nonsingular, normal model of a field = of degree of 
transcendency two relative to K, then the existence of a modular base for o* 
over K #*, &*] consisting of v( &*/E*) ]) linearly independ- 
ent homogeneous elements implies that the field 2 is regular. Moreover, if F is 
a nonsingular model of a regular field = and if 0;* is the ring of homogeneous 
coordinates along a derived normal surface F of F belonging to a sufficiently 
high character of homogeneity & [9 ](?), then o,* has an independent homogene- 
ous modular base relative to a suitably selected set of independent variables. 

The proof of the first part of this statement, to the effect that the exist- 
ence of a base implies regularity, is completely arithmetic in nature. For the 
proof of the converse, however, we found it necessary to use a lemma of 
Severi [6] which this author uses in his derivation of the Riemann-Roch 
theorem for surfaces. The proof of the converse by means of totally arith- 
metic arguments presents auxiliary problems requiring considerable investiga- 
tion in themselves. We have not as yet been able to cope with all of the diffi- 
culties arising in connection with them. 

Throughout this paper we shall use the term normal to mean arithmeti- 
cally normal. The coefficient field K is always assumed to be algebraically 
closed and of characteristic zero. 

1. Definitions and preliminary remarks. Let = be a field of algebraic 
functions of degree of transcendency r relative to a subfield K. By a prime 
divisor of = we shall mean any (r—1)-dimensional valuation of 2. If ¢& is 
any integral domain in 2, and if § is a prime divisor whose v-ring 8($) con- 
tains $, then § is said to be of first or second kind with respect to $ accord- 
ing as the center(*) of B in & is or is not (r —1)-dimensional. 

If - - -, &* are the homogeneous coordinates of the general point 
of a projective model V, of 2, and if o*=K[é#*, &*, - --, &*] is the asso- 
ciated integral domain, then according to a well known theorem of Noether, 
a nonsingular projective transformation can be found which will insure that 
the first r+1 £*'s, &*, &*, - - - , &* are algebraically independent and that the 
remaining ones, £41, - - - , £,.* (and hence the entire ring o*), will be integrally 
dependent on &*, &*,---, &*. If, then, we specialize o* to the various 
j=0, 1, 2,--+, 7, it follows that in each the first r coordinates are alge- 
braically independent and the rest depend integrally on them. It is well known 
that if % is a prime divisor of 2 such that B(%) contains two of the rings 0;, 
Say 0a, 0g B(P), and if $ is of first kind with respect to 04, then § is also of 
the first kind with respect to 0s. Consequently, one may speak of a prime 
divisor as being of first or second kind relative to a given projective model V, 


(?) Numbers in brackets refer to the bibliography at the end of this paper. 

(*) By the center of $ in $ is meant the prime ideal in 3 consisting of those elements 7 
such that v($, ») >0. Throughout this paper we use the notation v(f, x) to denote the value 
of x in the valuation . 
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of 2. Thus we may associate with 'a model V, of = a set ©(V,) of prime di- 
visors, consisting of all those prime divisors which are of first kind with re- 
spect to V,. 

Using the elements of 6(V,) as generators we form the abelian group 
@(V,) which consists of all finite, formal, power products PB"Py - - - Prt 
where $;€ S(V,) and a; is an integer, i=1, 2, - - - ,&. The elements of G(V,) 
are called divisors. The divisor &=]]*_,B2* is said to be integral if a;>0, 
#=1,2,---,k. 

With every element 7 in 2 one can associate a uniquely determined di- 
visor in G(V,), 


A(n) II Pr, »(n) ae o(f, n), 


where the product is extended over all prime divisors of the set S. (When 
no ambiguity will result, we shall write 7 instead of %(n).) Those divisors 
which are associated in this way with elements 7 in 2 form a subgroup § of 
@(V,). Two divisors % and % are said to be equivalent if A-BV-"*E H. 

With every prime divisor $ in S one can associate a unique, irreducible, 
(r —1)-dimensional subvariety, V,1(%), of V,. With every integral divisor A 
in G@(V,), say X=] we associate the effective subvariety 


V(X) = A = a V(Pi) + a2V(P2) + - - - + 


The complete system determined by & (or A) is, then, the set of all effective 
subvarieties V (%{’) which are associated with integral divisors YA’ which are 


equivalent to %. 
In what is to follow, we shall need the following lemma, proved in [4]. 


LEMMA 1.1 The algebraically independent quantities &, &,---, & 
2, - - - , r) have divisor representations of the form &;=A;/Ao, 
4#=1,2,---, 7, where A; and Ao are integral divisors and no prime divisor di- 
vides both A; and Ao. 


The following lemma is also needed. 


Lemma 1.2. If B ts a prime factor of Ao, say Ayo=P°As where Ag is prime 
to and if &, ---, &) ts any polynomial in &, ---, & of degree h, 
then o(f, f) = —hB. (The notation is that of the previous lemma.) 


Proof. Assume first of all that f=@,, a homogeneous polynomial in 
&,-- +, & of degree h, so that =o (1, - - - , This ele- 
ment is in 01, and since § is of first kind, and at finite distance with respect 
to 0:1, the elements £/£1, --- , &-/& must be algebraically independent 
modulo the center of in Hence =0 and oa) £1) 


= 
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If f is an arbitrary polynomial then the lemma follows for f also since 
o(P, a+b) =min (o(F, a), v(P, b)) if o(PB, a) 5). 

It is not difficult to see that if w is any integral function of &, £, ---, & 
then w admits a divisor representation of the form 


w = 


where % is an integral divisor and & is an integer. In fact, if w is an integral 
function of £1, &,---, &, then w has non-negative value at any prime di- 
visor at which the values of £1, £, - - - , £,are non-negative. Conversely, any 
element w in 2 which admits a representation of the form w=%/%} is an in- 
tegral function of &, &, - - 

2. Exponents of integral functions. We now show that the Dedekind- 
Weber method of assigning exponents to integral functions can be carried 
over to the case of functions of several variables almost verbatim. The real 
significance of these exponents is intimately connected with the notion of 
arithmetically normal varieties, and we shall treat the subject from this point 
of view. 

Let - - - , &* be the homogeneous coordinates of the general point 
of a variety V, in the projective space P,. It is assumed that V, is arithmeti- 
cally normal so that o* =K[é#*, &*, - - - , &*] is integrally closed in its quo- 
tient field >*. As in the preceding section we assume that - , are 
algebraically independent and that o* depends integrally on these r+1 ele- 
ments. We again consider the rings 00, 01, - , Or, 0s = K - - -, 
&*,/t*,---, &*/&*]. If X is any element in oo, then A is a polynomial 
S(&, En) (Es=EF/EM) Of degree h, say. Consequently, &**-A=X* is 
an element of o* which is homogeneous of degree h. It is pointed out in [9] 
that any element A* in o* which is homogeneous of degree h satisfies an equa- 
tion of the form 


where a; is a form in é*, &*,---, &* of degree h-t, i=1, 2,---, v 
&*)]). If we divide equation (2.1) by &*, 
+=0,1,---,7, we find 


In other words \*/£?*(=X/#) is integrally dependent upon 0; and hence 
is in 0;. We thus see that given any element \ in oo there exist integers’ 
such that A/#Co,, «=1, 2,---, 7. Clearly, if this is true for one integer h 
then it is true for any integer k greater than h. If for some 4, the integer h is 
such that A/Co,, then it is also true that \/£?Co,, for any j=1, 2, - - - r. 


In fact, since A is in 09, we have 
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(2.2) d = 


where % is integral, and 
(2.3) = (B/Mo) - = 


If & is the smallest integer such that (2.2) holds (that is, if 8 is not a multiple 
of Mo), then the fact that \/£Co,; implies that h=k, for \/& Co, implies that 
d/# may be written in the form 


(2.4) = M/A; 


where QM is integral. Equations (2.3) and (2.4) then imply that &5-'-W?-M 
= $Y and, since A; and MW, are relatively prime, k Sh for B is not divisible 
by %». Since we can represent A in the form \=%’/%}> it follows that 
=B’/U}, and hence that A/#Co,. 

We therefore define the exponent of the function \ (notation: exp A) to be 
the smallest integer 4 such that \/£Co;. The exponent depends only upon A 
and not upon the particular &; which we take to define it. In the case of one 
independent quantity, &, this definition is precisely that of Dedekind-Weber. 

Several obvious, but important, properties of exponents suggest them- 
selves immediately. We list them below. 

(a) exp A\=h if and only if X=B/U}, where B is an integral divisor not di- 
visible by Ap. 

In fact, if \=B/%> then clearly expASh, but if expA=h’<h, then 
\=%’/%*’ as has been shown. This implies that B is divisible by Mo, which 
is not the case. 

(b) As a corollary to (a) we have exp \=0 if and only if KEK. 

(c) If n and £ are two elements of 00, then exp (n+) Smax (exp 7, exp $), 
and if exp n>exp ¢ then exp (n+$) =exp 7. 

If exp »=h and exp {=s, and if h=s then n/# and ¢/# are in 01. Hence 
exp (n+¢) Sh. If h>s, then (n+£)/# 7" is not in 0, for if it were then we would 
have n/£"'=(n+¢)/£*!—£/#"' in 01, which is not the case. 

(d) If and exp A\=h, then d is a polynomial in - - , En of degree 
less than or equal to h. 

This statement holds in view of the fact that V is a normal model of 2 
and hence the system of hyperplane sections on F and all of its multiples are 
complete [4]. 

(e) If &, - - En) ts polynomial of degree h then exp A Sh. 

(f) If n and § are in 0o, then exp (n-{) Sexp.n+exp ¢. If 7 is a polynomial 
in the independent elements £1, £2, - - - , & only, then exp (n-{) =exp n+exp ¢. 

The first statement follows directly from (a); the second fellows from (a) 
in view of Lemma (1.2). 

(g) If Ao is a prime divisor B then the equality sign always holds in (f). 
Moreover, exp X= —v(§, A). 
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(h) Jf - , an element of o* of degree of homogeneity 
h then exp X (A=¢/E¢**") is less than h if and only if $ is divisible by Ex in o*. 

Assertion (h) follows almost immediately from (d) and (e). If expA=k<h 
then, by (d), \ is a polynomial of degree less than or equal to Rin &), £2, - - -, En; 
h=f(éi, &, -- +, say. Consequently y=é&**f is an element of o*, and 
On the other hand, if in o*, then 
=’ /Es*-* =g (£1, £2, - - - , &) where g is a polynomial of degree h—c. By (e) 
this implies exp \Sh—c. 


THEOREM 2.1. If 0* has an independent modular base over K E*, - - -, 
consisting of v homogeneous elements d*, - - , - - -, &)] 
then 09 (and of course also 0;, 7=0, 1,---, 7) possesses a DW-base over 
K [&:, - - &] and conversely. 


Proof. Assume that the elements A*, A#*, - - - , A,* are arranged in the 
order of increasing degree, so that if A* is homogeneous of degree r; we have 
rises --- Sr,. If w* is any homogeneous element of o* of degree h, we have 


(2.5) w* = + +--- + C , 


If P*=)°P%, where P% is homogeneous of degree j, it follows immedi- 
ately by an application of the automorphism 7:é*-t&* of Z* over = 
E*,--:, Ek) that P§=0 if and that P*=Pj_,,. In 
other words, in the representation (2.5) of the homogeneous element w*, the 
coefficient of is homogeneous of degree h—7;. 

It follows from the above remarks that none of the elements \¥ is divisible 
by &¢* in o*, In fact, suppose A* = &j*w*. Since w* is of degree r;—1 (we as- 
sume for the moment that 7>1) we have - - - and 
hence - - - where QO? = This is in contradiction 
with the independence of the elements A¥. In particular, A#* must be a con- 
stant, which we may take to be 1. ‘ 

If we let A; =AF/Ec*%, it follows that A; is an element of 09 of exponent r;. 
If w is an element of 09 of exponent h, then &**w is homogeneous of degree h, 
and is in o*(*). Hence we have &**w = PfAS+PPAs + - - - +P*A* where P¥ 
is homogeneous of degree h—r;. Consequently, we have w=)-PAr,, where 
P;=P}/ts*-+. This implies of course that exp P;( =degree of P;) is less than 
or equal to h—7;. 

Since this argument could be carried out with any of the rings 0; as well 
as with 00, it follows that the elements Au, As, - - - , A, have the properties of a 
DW-base. (See introduction.) 

Conversely, if Ax, 2, - - - , A, form a DW-base in 0p relative to the ring 


(*) This assertion follows by property (d) above. In fact, since exp w= h, w is a polynomial 
of degree hin &, &, and hence is a form of degree hin 
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K[&:, &, - --, &] and if exp \;=7; it follows immediately that the elements 
\* =£ A; form a modular base for over t*,---, &], 

3. Differential divisors and the geometric genus. We shall henceforth as- 
sume that we are dealing with a field 2 of degree of transcendency two over 
the coefficient field K. We use the same notations as before except that we 
put r=2. We select the quantities &¢*, &*, &* in such a way that in addition 
to our previous requirements we also have [2:K(é, )]=v, the order of our 
model F. (We put & 7=1, 2, ---, m.) 

Let A be any subfield of 2 such that 2 is algebraic over A. If $ is any prime 
divisor of 2 with valuation ring %, then 8, = S/(/A is a valuation ring in A 
which defines a prime divisor B, in A. The divisor 4 is the valuation which 
induces in A. If p, is the ideal of non-units in Ba, and p» is the ideal of non-units 
in %, then the extended ideal p,-% is a power of p, say pa-G=p*, e=1. The 
divisor [ is said to be ramified with respect to the field A if e>1, unramified 
if e=1. The integer e—1 is called the ramification degree of $ relative to A. 

We restrict ourselves to the case in which A= K(&, &). We consider the 
ideal (Af/dz),.. [9], where w is an arbitrary element of 
and f(z; £1, &)=N(z—w). If pi, pe, ---, pe are the one-dimensional prime 
ideals in 09 which occur in the decomposition of Z;¢,,¢,) then we have 


(3.1) © Pi De 


where the symbol ~ denotes quasi-equality in the sense of van der Waerden. 
It is well known that if $ is ramified with respect to A, and if the v-ring S 
of $ contains 0o, then p, the center of $ in 09, is one of the primes )y, Po, - - -, De 
and conversely. Moreover, the ramification degree of $ relative to A is pre- 
cisely the exponent @ assigned to its center in the decomposition (3.1). We 
conclude therefore that there are only a finite number of prime divisors in 
the set S(F) which are ramified over the field A (=K(é,, £)). 

The assertions of the preceding paragraph are all consequences of the 
following lemma. 


LEMMA 3.1. If 2 is any prime divisor of the field = (at finite distance rela- 
tive to the ring 00) of ramification degree e—1 relative to K(&1, 2), then there is a 
primitive element 0 in 09 such that f'(@) =0 (pe), f’(0)40 (p™), where p is the 
center of B in 0o, and f'(0) =(Of/0z).u9. Moreover, it may be assumed that 0 


Proof. The proof is similar to the considerations set forth in [10, para- 
graphs 11, 12, and 13]. We assume that &, is transcendental mod § so that 
the residue field of § is algebraic over the field K(£). We let L denote the least 
normal (Galoisian) extension of K(é) which contains the residue field of $B. 
Together with 09 we consider the rings of =K(£)[f, &,---, &] and 
Oo=L és, - - -, En]. In view of the fact that is transcendental mod p 
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there is one and only one prime ideal »’ in of such that p’/\0o= p. Moreover, 
if &’ is the quotient ring of of at p’ and §& that of 09 at p then clearly ¥ = 3’ 
and consequently the ramification degree of $ as a “point” of the one-di- 
mensional variety corresponding to og is the same as the ramification degree 
of $ as a “curve” of the variety belonging to 09. Now if a1, a2, - - - , a, are 
the prime ideals in Oo which lie over p’, that is, as\oj =p’, then Dop’ 
= 2, - -, aa]. Moreover, it follows immediately from the first part of 
the proof of Theorem 7 in [10] that if m is any non-negative integer then 
= p’™(5). 

Consider the field L(&)>K(é1, &), and let P be the divisor induced by $ 
in K(é&, &). The divisor U; in L- 2 =A defined by a; in Oo induces a divisor A; 
in L() which lies over P in K(é, &). Let p, a; be the centers of P and A; 
in their respective v-rings, and let <4; be the center of W; in its v-ring. If R; 
is the v-ring of U; then we assert that a;-R;=c4{ where e—1 is the ramification 
degree of $ relative to K(é, &). In other words, the ramification degree of A; 
relative to L(#) is the same as that of § relative to K(é, &). To see this, let R 
be the v-ring of P in K(é, &) and let S; be that of A; in L(). Nowa;=pS; 
and hence a;-R; = p- R;. On the other hand, R; = and p- Ri = (p- F’) -R; 
=cA’*-R; where denotes the center of in Since 4’ - R; we have 
cA’*-R; This implies the assertion. 

The lemma under consideration is well known in the case of functions 
of one variable. In fact, if --- is a primitive 
element of A over L(é) then for non-special “constants” c;, we will have 


(0) =0(as~'), (af). Since the values of c; that must be avoided are 
those satisfying certain algebraic relations, we can, in particular, find such 
quantities in K. For such a choice of the c’s, 8 will be an element of 09 and so 
also will f’(@). This proves the lemma in view of the fact that af’(\00=p™, 
which holds, in particular, for m=e—1, e. 

The ramification divisor 3, of = relative to A is defined by the relation 


(3.2) Ba II S, 


where e—1 is the ramification degree of § and the product is extended over 
all prime divisors of the set S(F). 34 may of course contain prime factors 
which are also factors of Wo. (Wo is the common denominator of & and & 
mentioned in Lemma 1.1. It is given by the curve £>* =0 on F.) 

We let © denote the complementary module of 0» relative to K(é:, &). 
consists of those elements in 2 such that 'S(¢«w) is integral for every w 
in 09. S(ew) denotes the trace of ew relative to K(&, &). The module Gp is of 


(®) The methods of [10, Theorem 7] are applicable here in view of the fact that as a mini- 
mal prime ideal of the integrally closed ring 0o, p defines a simple subvariety of the model V; 
associated with 0» and hence p’ defines a simple subvariety (point) of the model V/ associated 


with 03. 
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paramount importance in the theory of ramified divisors. We prove that &» 
is the inverse system of the ideal Z;¢,,¢,). To this end two lemmas are needed, 
the first of which will be used later in another connection. 


Lemma 3.2. If A=K(&, &) and $ is any prime divisor of the set S(F), 
and if Be, - - - , Be are the complete set of prime divisors in which 
induce the same valuation in A as does ¥, then for any function n such that 
o(B;, n)>0, 2,---, it ts true that v(B, S(n))>0, where the trace is 
taken relative to A. 


Proof. Let P be the common contraction of $1, Bz, - - - , $, with A and 
let 2 be the least normal extension of A which contains 2. If Pi, Pe, ---, P, 
denote the various valuations of 2 which induce the valuation P on A, then 
each of these induces one of the valuations ; on 2,i=1, 2, - - - , &. The inter- 
section B,/\B,/\ - - - (\8, where 8; is the v-ring of P; is the integral closure 
(in Q) of the v-ring B of P in A [3]. We denote this intersection by D. The 
ring D contains s distinct maximal prime ideals ai, a2, - - - , ag where a; con- 
sists of those elements 6 in D with the property v(P;, 6) >0. By hypothesis, 
n is contained in the intersection a=[a:, az,---, a]. Since the ideals 
Qs, G2, * - - , @, are conjugate over A [3], the intersection a is an invariant ideal, 
and hence not only 7 but also all of its conjugates are in a. Hence their sum 
S(n) is in. a. Since a(\% is the ideal of non-units in 8, we conclude that 
o(P, S(n)) >0, q.e.d. 


Lemma 3.3. An element ¢€ of = is an element of the module &, if and only if 
o(P, €)= —e-+1 for every prime divisor of the set S(F) whose v-ring contains 0o 
(e—1 denotes the ramification degree of $). 


Proof. Let ¢ be an element of 2 with this property. If ¢€ is not an element 
of & then thére is an element w in 09 such that S(¢w) is not integral. If we re- 
strict our attention to the group @, of divisors in A generated by the prime 
divisors in A of first kind with respect to the rings K[é1, &], K[1/&, &/é&], 
and K [£:/:, 1/2], then S(ew) will have a decomposition of the form 


sao = (Hh) 


where a;, 8;>0, o is an integer, P; and Q; are distinct prime divisors in A, 
and A, is that divisor in A which correspords to Mo in 2, that is, £;=A;/Ao, 
¢=1, 2. 

We fix our attention on one of the divisors Q;, say Q,. If gq: is the center 
of Q; in R (=K[&:, &]), then g: is a principal ideal, say g.=R-¢, where ¢ is a 
polynomial in £1, &. We consider the extended ideal 099: which has a decom- 
position of the form og: = - - , By definition, it follows that 
fi—1 is the ramification degree of the p-ime divisor of 2 determined by 4. 


= 
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Now by our hypothesis we have v(p;, ew) = —f;+1+-f; (since v(p;, w) 20), 
and hence v(p;, ewh)>0, i=1, 2,---+, s. By Lemma 3.2 it follows that 
v(Qi, S(ewp)) >0, or v(Q:, S(ew))+1>0, and hence we have v(Q;, S(ew)) 20, 
which is a contradiction. We therefore conclude that ¢ is an element of Gp. 

Conversely, it is well known (see for example [5]) that for any primitive 
element @ in 09 the relation f’(6)-€oCoo, where f(z) =N(z—8@), holds. More- 
over, by Lemma 3.1 it follows that for any prime divisor $ with center p 
in 09 and ramification degree e—1 there exists a primitive element @ in 059 such 
that f’(@) =0 (p“-”), f’(@) 40 (p™). From this it follows that for any € in &> 
the inequality v(%, €)= —e+1 holds. This proves the lemma. 

This characterization of €) in terms of the ramified divisors at finite dis- 
tance relative to 09 (that is, having v-rings which contain 09) is obviously 
equivalent to the statement that @ is the inverse system of Z;¢,,¢,) in Z, in 
view of the relation (3.1). Making use of this fact, it is not difficult to see that 
every element ¢ of &) admits a divisor representation of the form 


(3.3) € = (D/3Bs)Mo 


where D is an integral divisor and r is an integer which may be positive, 
negative, or zero(*). Conversely, any element of 2 which admits a representa- 


tion such as (3.3) is in G». 
An integral divisor D equivalent to the divisor 3,%r* (in symbols 


D~3ZsUc*) is called a differential divisor of first kind. To every such di- 
visor D there corresponds an element 4, necessarily in ©, which expresses 


the equivalence, namely, 


(3.4) 5 = D%/Ba. 


The geometric genus ~, of the surface F (a normal nonsingular model) 
is defined as the number of linearly independent differential divisors of first 
kind, that is the number of linearly independent elements 6 in &, of the form 
(3.4). A curve K on F determined by a differential divisor of first kind is 
called a canonical curve, and the complete system |K| is called the pure 
canonical system on F. The effective dimension of | K| is p,—1. 


(*) Equation (3.3) follows immediately if none of the prime factors of Wo is ramified. 
Otherwise, the proof is as follows. If 8 is any prime factor of Yo, say Wo= B*- As , where Ws is 
prime to $, then e—1 is the ramification degree of $. Since $ is at finite distance relative to 
the ring 0; (= K[1/&, , &n/&]) it follows that there exists an element @, in 0; of ex- 
ponent one, such that f’(@;) =0 (p‘*-»), f’(@:)#0 (p) where p is the center of $ in 0;. Since @, 
is of exponent one it follows that 0= £6; is in 0o. A simple computation leads to the conclusion 
that o(f, f’(0))= —e(v—1)+e—1, so that in the representation f’(0)=23,4/Wo" the divisor 2 
doesnot have asa factor. Since f’(@)=M/ a, where is integral, we have e= (M/23a)Ao*. 
Applying this process to each of the factors of %o in turn, we find that e= (M/U3s)%, where U 
has no factors in common with Wo. It then follows by Lemma 3.3 that 9% contains U as a factor. 
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As it stands, our definition of differential divisors of first kind and of the 
geometric genus ~, is but a formal generalization of similar definitions given 
by Dedekind and Weber in [2]. However, it is not difficult to see that if one 
considers a generic projection of our normal nonsingular model F into a sur- 
face in S; given by f(&, £, 6) =O where - - - +¢n€, then the ele- 
ments 6f’(@) will be adjoint polynomials, ¢,4 of degree v—4, to the surface 
f(&:, &2, 8) =0. Hence since 5=¢,_,/f’(@) the p, independent elements 6 deter- 
mine the p, independent double integrals of first kind attached to F. 

4. The arithmetic genus. Let xo, x1, - - - , x, be indeterminates and let P 
denote the prime H-ideal in K[xo, x1, ---, x»] which determines a nor- 
mal, nonsingular model F of 2. In other words, P is a prime ideal in 
K [xo, x1, , Xn] such that K [xo, x1, - - , x, |/P2¢o* and under the 
isomorphism. Van der Waerden has shown [7] that if x(P, h) denotes the 
number of homogeneous elements of degree h in K[xo, x1, - - - , x2] which 
are linearly independent (over K) mod P, then for large values of h (that is, 
for h>ho, where hp is a fixed integer depending on P) x(P, h) is given by 


(4.1) x(P, h) = + + ae 


where do, a1, and az are integers independent of h. Van der Waerden showed 
further that a) =v, the order of the surface F. 

Formula (4.1) can also be derived by means of the Riemann-Roch theo- 
rem for surfaces, with the help of Noether’s formulae for the genus and order 
of a composite curve [8]. The theorem of Riemann-Roch states that if | D| 
is a complete linear system on a surface F then its dimension r is given by the 
formula r=n—2+p,.+1—i+s where n, 7, and 7 are respectively the degree, 
genus, and index of speciality of |D|, and p, is the arithmetic genus of F. 
The non-negative integer s is called the superabundance of the system, and 
the difference s—i is called the irregularity of |D|. The system |D| is said 
to be regular if s=7=0. It has been shown by Severi, that ff lc | is a non- 
singular system of dimension r23, then for large values of & the system 

Cal = |hC | is a regular system. Applying the Riemann-Roch theorem to 
C,|, and denoting its dimension, degree and genus by pa, v, and 7, respec- 
tively, we have 


(4.2) Pr=m— 1. 


If F is a normal, nonsingular model of the field 2, and Ic | is its system 
of hyperplane sections, then lc | is nonsingular, and the complete system 
|kC| is cut out on F by the hypersurfaces of order h in its ambient space [4]. 
Hence in this case, x(P, h) = p,+1. A straightforward application of Noether’s 
formulae to formula (4.2) yields 


(4.3) x(P, h) = (h(h — + 
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where » is the order of F and 7 is the genus of | Cc | (7). 

As yet, no purely arithmetic proof of formula (4.2) is available. For the 
purposes of this paper we shall regard formula (4.3) as the definition of p,(°). 

5. The existence of a DW-base implies p, = p.. Let o* =K -- -, &,*] 
where &¢*, &*, - - - , &* are the homogeneous coordinates along a normal non- 
singular model F of the field 2. We make the following assumptions: 

(a) £*, tf, &* are algebraically independent and every element of o* de- 
pends integrally on them. 

(b) There exist »y (=order of F) homogeneous elements A¥*, As*, - - - , A* 
in o* linearly independent over K [é*, &*, &*] with the property that ifw* 
is any element in o* then w*=P?A*+PFAF+ - - - +P¥AF, where P# is a 
polynomial in £¢*, &*, &*. We let r; be the degree of A¥, and we assume as we 
did in Theorem 2.1 that 1SreSr3S --- Sr, 

In the course of the proof of Theorem 2.1 we saw that under the above as- 
sumptions we could assert in addition that if w* is a homogeneous element of 
o* of degree h, and if, say, 7, is the last integer in the sequence 71, 72, - - -,7,such 
that h2r, (that is, h<7.4: if is defined), then w* = PPAF +PHAF+ - - - 
+P where P¥ is a homogeneous form in &*, &* of degree h—71;. 

The function x(P, 4) mentioned in the preceding section gives the number 
of homogeneous elements in o* of degree #4 which are linearly independent 
over K. If in particular 4>r, we know that any homogeneous element of de- 
gree h can be written in the form - - - +P*A* where P# is 
homogeneous of degree h—1;. Since As*, As*, - - - , A* are linearly independent 
over K [é#*, &*, &*] it follows that a linearly independent K-base for the 
homogeneous elements of degree h in 0* can be obtained by taking together 
all of the elements of the » sets of elements of the form a! where aY runs 
over a linearly independent base for the homogeneous polynomials in &¢*, &*, &* 
of degree h—r;. There are, then, (1/2)(4—1r;+1)(4—1;+2) such elements in 
the ith set {a’X#} and hence we have 


1)\(hk — 7; 2 


A simple computation (taking into account that r,=0) yields 


(") For a complete discussion of these questions see [8, chapter 4]. 

(8) It seems to us that from the arithmetic point of view, formula (4.3) is a very natural 
way in which to introduce the arithmetic genus. The question might be raised as to the neces- 
sity of requiring F to be nonsingular as well as normal in order that formula (4.3) hold. The cone 
projecting an elliptic cubic from a point not in the plane of the cubic furnishes an example of a 
surface in S; which is arithmetically normal but on which the complete system | Cc | cut out by 
the surfaces of order h in S; has a superabundance s=-+1 for every h. 
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1. 


Combining (5.1) and (4.3) we conclude that 


The proof that p,=, is completed by showing that , can also be ex- 
pressed in terms of the integers 7;, and that the expression one thereby ob- 
tains is p, =(1/2)>-7-3(r:—1)(r;—2). This is somewhat more difficult. 

As was pointed out in Theorem 2.1, if then Au, Ax, - - -, A, 
is a DW-base in 09 relative to the ring K [£:, &]. It is well known that there 
exist vy elements @,---, € in 2 satisfying the equations = 4;; 
(6;;=1, 6;;=0, These elements ¢€ are all in ©) (the complementary 
modulo of 09) and, in fact, form a modular base for ©» over K [£:, &], called 
the complementary base to Ax, Ax, -- +, A» The elements 4, @,---, € are 
of course linearly independent over K(é£:, &). If € is any element of ©, then 
we have €=a16¢:+a2@+ - - - +0,€,, a;CK[&, &], and it follows from the 
equation S(¢,A,;) = 6;; that the coefficient a; is equal to S(é\;). 

We assume, in particular, that € is of the form D%/ 3, and we assert 
that if $ is any factor of Wo, then v(f, He) >0. This is obvious if B is not a 
factor of 3,. If, on the other hand, 34=$*3’ (3’ prime to $), 8>0, then 
since f induces the divisor P in K(&, &) at which (1/£:) has positive value, 
it follows from the definitions of 34 and W» that o(f, 1/£) =8+1, and that 
Ay = J. We conclude that «)=3(6+1) —8=28+3 and hence that 
He) = 28+3—2(8+1) =1. 


Writing ¢ in the form =) a,e; we have 


(5.4) a;=S(d), = 


Since is an element of 0: (01=K [E#/é#*, its 
value at any of the prime factors of %» is non-negative, and hence the value 
of #ed;/#* is positive at any such factor. It follows by Lemma 3.2 that 
o(B, >0, and hence by (5.4), a:/é*-*) >0 for any such prime 
factor $. By Lemma 1.2, the inequality v(f, a;/#*~”) >0 implies that if r;<2 
then a; is identically zero, and if 7;>2 the degree of a; is less than r;—2. 

We prove the converse of this result. That is, if e=) a;e; is an element of 
©» such that a;=0 if r; 2, and for those values of « for which r;>2 the degree 
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of a; is less than r;—2 then the element ¢ admits a divisor decomposition of the 
form ¢=D%3/3s. We consider the set of elements A/, Ad, -- -, AZ where 
Af =\;/EF. As we have seen, these elements form a DW-base for the 
ring 01. It is well known that the base e/, «&, - - -, € complementary to 
Ai, Az, AY is related to the base &, €, - - - , €, by the equations e/ = 
These elements form an independent modular base over K [1/&, &/£] for 
the complementary module G, of 0; relative to the ring K [1/£1, &/& ]. 

Let $ be any factor of %o, say %o= PAs where Ay is prime to P. If e>1, 
we have $* as a factor of 3,4. Since § is a divisor at finite distance in the co- 
ordinate system which gives rise to 01, and since ¢€/ is in ©, it follows that 
ef )= —e+1. Hence we see that o($, (1/£:)e/ ) =e+(—e+1) =1. We thus 
have o(f, >0 and >e(r;—1). By our assumptions concerning 
the coefficients a; we have v(, a;) = —e(r;—3), and hence we conclude that 


v(P, a:€;) > 2e. 
The element a,e; is in o, and if a; is not identically zero we can write 


aye; = (M/Ba)B: Be 


where MM is an integral divisor and 1, Pe, ---, B, are the factors of Wp. 
If (e;—1) is the ramification degree of ; then by our above results we have 
$1—(e;—1) > 2e;, and hence s; = 3e;. Hence, since %o=] we conclude that 
the element a,é; is-of the form 


ae; = AoN/ Ba. 


The integral divisor ?t is therefore a differential divisor of first kind. The 
number of independent elements of the form a;€; in €) which give rise to differ- 
ential divisors of first kind is therefore zero if 7,2 and (r;—1)(r;—2)/2 
(=number of independent a; of degree less than r;—2) if r;>2. Since the ¢; 
form a base for ©, we conclude that 


(7, — — 2) 2. (rs — — 2) 


(5.5) => => 


2 


This completes the proof that the existence of a base implies p, = pa. 

6. A lemma of Castelnuovo. This section is devoted to establishing an 
important lemma due originally to Castelnuovo [1]. We consider a normal, 
nonsingular model F of 2, and we use the same notations as before. In par- 
ticular, we consider the various complete systems |C,| =|4C| cut out on F 
by the hypersurfaces of order h of its ambient space P,. 

The system | C,| cuts out a linear series g,, on a generic curve C of the 
system of hyperplane sections | C | . (In what sense C is generic will be specified 
in a moment.) The series g,, has a certa'm index of speciality 7, and a certain 
deficiency 6,. Clearly, if # is large we will have 1,=0. Moreover, since C is 


1943] INDEPENDENT INTEGRAL BASES 355 


generic in | C| it is nonsingular and hence locally normal. This implies that 
5, =0 for large values of h [12]. The lemma of this section asserts the follow- 


ing equality(*) 


(6. 1) = p (in 
h=1 

Proof. We subject the homogeneous coordinates £*, &*,---, &,* toa 
projective transformation §;=)>ai;£*, a;;@K. For non-special values of the 
a;; the property of integral dependence of o* upon the first three coordinates 
will not be lost. Moreover, since the system | Cc | is obviously not composite 
with a pencil, it follows by the theorem of Bertini-Zariski [11] that for non- 
special values of the constants a;; the curve defined on F by the equation 
,=0 will be irreducible and nonsingular. In fact, it is easily seen from 
Zariski’s consideration in [11] that for non-special constants -a;; the ideal 
o*£, will be a prime H-ideal in 0*. (This of course involves the fact that o* 
is integrally closed.) It is in this sense that we speak of the generic curve 
in | C|. We fix a set of constants a;; so that these requirements are satisfied. 
The characters 4, and 6, are understood to refer to the series g,, cut out on 
&=0 by | Chl. 

As before we let x(P, 4) be the number of independent (over K) homo- 
geneous elements of degree h in o*, and we let yu, be the number of homogene- 
ous elements of degree h which are independent (over K) modulo o*%&. It 
is not difficult to see that x(P, &)=patwact --- +e. In fact, let 
92:, - , be a set of u; homogeneous elements in 0* of degree which 
are independent modulo 0*f. The elements { ---, 

- , &3-0,,«, - - - } are obviously independent, and every homogeneous ele- 
ment of degree h in o* is a linear combination of these elements. 

By the Riemann-Roch theorem (for curves), we have nz.=kv—2+i;, 
—6.+1, k=1, 2,---, hk, where x is the genus of &=0, and since, clearly, 
we have 


(6.2) x(P, hk) = 1+ +1} 
kewl 


or on rearranging (6.2) we have 


h(h—1 
(6.3)  x(P, i) (és — +1. 


k=l 


Since >-}. ,(is— 8) remains constant for sufficiently large h we see, if we 
compare (6.3) and (4.3), that => 5,), q.e.d. 


(*) The original lemma of Castelnuovo [1] asserts that if one is given a system |I'|, then 
for high values of hk the superabundance of jar] has a constant value s, and s+pe=2_ (¢a— 6a). 


| 
| 
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7. On the indices i,. We use the same notations and assumptions as in 
the preceding section. The index of speciality, js, of the system | C,| is defined 
as the dimension, increased by 1, of the system |K—C,| where | K| is the 
canonical system on F. If |K —C,| does not exist we put 7,=0. We observe 
first of all that 7, =0 if 4 is sufficiently large. 

The lemma of Severi [6] mentioned in the introduction is to the effect 
that if h>v—4 then the canonical system |K | , of tt exists (that is, if p,>0), 
cuts out a complete series on a generic curve C, of | Cal . If p,=0, the characteris- 
tic series(*°) of | C,| is non-special. Since, in any case, the series cut out on 
|C,| by |K| is totally contained in the series difference of the canonical 
series on C, and the characteristic series on C,, it follows, by Severi’s lemma, 
that for h>v—4 the complete series difference between the canonical and 
characteristic series on Cy is cut out by the canonical system |X]. If 
in addition hy (>v—4) is taken large enough so that j,,=0, then for all 
h=ho we have that the index of speciality of the characteristic series of C, 
is precisely p,. 

The formulas of Noether [8] yield the following two formulas immedi- 
ately: 


(7.1) = hy, = hr, + — 1)/2)n, — — 1) 


where v, and 7, denote the degree and genus of | or | . From these one easily 
sees that if h is sufficiently large, the inequality »,>a,—1 will hold. 

Let p be an integer greater than or equal to 4» and in addition large 
enough so that for h2p, »,>2,—1. We pass from F to the derived normal 
surface ¥, of F by referring the curves of | C,| to the sections of F,.cut cut 
by the hyperplanes of its ambient space Py. Since F is nonsingular, so is F, 
[9]. If i, is the index of speciality of the series cut out on a generic C, by the 
system | C;,|, then 


(7.2) i, = py = t3=--- = 0. 


(The condition iz=i3= - - - =0 is implied by the fact that ky, >2mx,—2 for 
k=2, 3, ---.) A model Fof = will be called non-special if it is normal, non- 
singular, and if the speciality indices on its generic hyperplane section satisfy 
equations (7.2). 

We return to the notation of §6, but we now assume that the model F 
considered there is non-special. Combining (6.1) and (7.2) we find 


(7.3) D bs = Po — Pow 
h=l 


It follows from (7.3) that if, in particular, F is regular then 5,=0, 


(**) By the characteristic series of | C,| on Cy, we mean, of course, the series cut out on the 
curve Cy by the complete system | C;|. 
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h=1, 2,- +--+. In other words, if F is a non-special model of a regular field, 
then the systems | C,| cut out complete series on the general hyperplane sec- 
tion C of F. This implies that the general hyperplane section of F is arith- 
metically normal [4]. 

8. Construction of an independent base. Let 0* =K &*,---, 
where é¢*, &*,---, &* are homogeneous coordinates along a non-special 
model F of the field 2. We assume that the quantities &*, &* and &* are 
selected so that they are algebraically independent, and so that every element 
in o* depends integrally on them. The only other restriction which we place on 
the elements &*, &* is that the net =0 (the c’s are 
parameters which vary over K) be sufficiently general in lc | so that the 
general curve of N will satisfy the condition of the preceding section, that 
is, we require that the general curve of N be arithmetically normal. We let 
R*=K [&#, &*, &*] and we shall prove that o* has an independent modular 
base over R*. 

We apply a nonsingular linear transformation 1; +anti* +an2k*, 
ai;€K, i=0, 1, 2, if necessary, in order to insure that (a) the three ideals 
0*no, 0*1, 0*y2 will be prime in o*, (b) that these curves will intersect pair by 
pair in v distinct points of the surface F, each point of intersection being a 
simple intersection, (c) each of these curves will be arithmetically normal. 
Since &*, &*, &* are algebraically independent, it follows that the net N is 
not composite with a pencil, and hence by [11] condition (a) is satisfied for a 
general choice of the constants a;;EK. It is easily seen that if the coefficients 
a;; are sufficiently general, conditions (b) and (c) will automatically be satis- 
fied. We point out that since o* depends integrally on n*, ni*, ns, the ideal 
o* (ne, ni*, is projectively irrelevant, and hence none of the intersection 
points of 7:=0 and y2:=0 can be on the curve 7)=0. 

It should be observed that the transformation from &¥, &*, &* to mo, m1, 72 
does not change the ring R*, and hence we may assume without loss of gen- 
erality that the particular elements £¢*, £*, &* have all of the special properties 
which we have assigned to 70, 1, 72. 

We pass to the ring 00=K [&:, &, ---, &] where &=£*/* and we ob- 
serve that if %=oo(&, &) then the K-module 00/% is of rank v, the order of F. 
In fact if &= [q:, q2, - - - » Ge] is the primary decomposition of & (q; is a zero- 
dimensional ideal) then by assumption (b) we must have s=v and q;=};, 
#=1, 2,---, v, where p; is a prime zero-dimensional ideal. It follows im- 
mediately that U*=o*(é*, &*) admits a primary decomposition of the form 
= [p*, p*,---, p*, q], where p? is the homogeneous prime in 0* corre- 
sponding to the prime ); in 09. As an ideal in o*, p¥* is one-dimensional, but 
as a homogeneous ideal its dimension is counted as zero("'). The ideal q is a 


(8) For a complete discussion of H-ideals and their related affine ideals, see the first two 
sections of van der Waerden [7]. 
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possible embedded component of &* which is necessarily projectively irrele- 
vant, that is, its associated prime in o* is the ideal o*-(&*, &*,---, &*). 
The central point in preparing for the construction of a base is the following 
lemma. 


LemMaA 8.1. The fact that &* =0 is an arithmetically normal curve on F im- 
plies that U* has no irrelevant components. 


Proof. If we let $ denote the ring 0*/£* then since o*£* is prime, $ is an 
integral domain, and since £*=0 is arithmetically normal, $ ts integrally 
closed in its quotient field >. This implies that if §*—>£, mod &*, then the prin- 
cipal ideal $&, is an intersection of minimal ideals in %, and has no embedded 
ideals in §. Since U*/£* = YE, it follows that A* can have no embedded ideals, 
q.e.d. We can thus write U* = [p;*, p#, - - - , p*]. 

We now show that under the conditions we have specified above the con- 
struction of Dedekind-Weber can be generalized to the case now under con- 
sideration. Let a, we, - - - , w, be v elements of 09 which are linearly inde- 
pendent modulo &(=oo(é:, &)). If Ay is the smallest integer such that 
w* = "tw; is an element of 0* then w* is homogeneous of degree h;. Let us 
suppose that in o* a congruence of the form 


(8.1) + + --- + = 


is valid, where P; is a polynomial in £&*. Let mé** be any term of Pi, 
so that c,£s**wi* is homogeneous of degree s=k+/. The homogeneous com- 
ponent of degree s on the left-hand side of (8.1) will then be qé*wy* 
+ + -- - where is a term of degree s—h; 
in P,. Since is an H-ideal, we must have = A E* + BE#, A, BCo*. 
If A=An+tAmit +Ao and B=B,+Bnit+ ---+Bo, where A; and 
B; are forms of degree 7 in &¢*, &*, ---, &* then under the automorphism 
T: we obtain 


+ = (Anti + + t™(Anatit + But?) 
+ + Boks). 


Since this is an identity in ¢, and 
If we divide this last equation by &¢** we find cw: +cor 
+¢w,=0 (YW). This implies - - - =c,=0. Repeating this proc- 
ess with the other homogeneous components on the left-hand side of (8.1) 
yields the conclusion that (8.1) implies -- - =P,=0. 

Conversely, if w* is an element of 0* which is homogeneous of degree s, 
then w*/E** =w is in 09 and a and bin 
and i=1, 2,---, v. If & is an integer so large that Ef 
and are all in 0* then we have 


= 
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i=1 

We select elements \#*, A#*, - - - , A* in o* as follows: take A* =1, and let 
As* be an element in o* of lowest degree of homogeneity, 72, such that A¢* does 
not satisfy a congruence of the form (A*), where r1:=0 
is the degree of A#*. We let A# be a homogeneous element of lowest degree 
in o*, say rs, such that A;* does not satisfy a congruence of the form 
+ (A*), K, for any integer k. We must 
have f2Srs3, for any element w* in o* of degree h<rz satisfies a congruence of 
the form =c,E¢*-" AF (A*). In general, if - - - , A* have been 
selected, and they are of degrees 7; (=0), 1SreSrsS - - - Sr; respectively, 
then we let A*,; be an element of smallest degree of homogeneity, ri41, which 
does not satisfy a congruence of the form 


for any integer k. We of course have 74127. 

This process yields exactly functions As*, - - - , in o*. This state- 
ment follows directly from the fact that 00/% is of rank v over K, in view of 
our considerations on the congruences (8.1) and (8.2). Moreover, if we put 
Ac then Ay, Ao, - - -, A, are linearly independent modulo Y, for if 
then for a sufficiently large value of k we have 
+--+ ,(A*). The choice of excludes the possibility of such 
a congruence. In a similar manner we deduce that if w* is an element of 
o* which is homogeneous of degree h, and if, say, r; is the last element of the 
sequence 7; S725 --- Sr, such that h2r; then w* satisfies a congruence of 


the form 


where & is an integer. Since h2r;, 7 =1, 2, - - - , 4, we can write 


(8.4) G cata?) = 0(*), 


and since &* has no projectively irrelevant components we have 
(8.5) w* — = AEM + BEF 
j=l 
where A and B are in o*. 
We can now easily prove the following theorem. 


THEOREM 8.2. The elements - - , forma modular base for o* over 
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the ring R*. In fact, if w* is a homogeneous element of degree h in 0*, and r; is 
the last integer r; such that h2=r;, then w* =P,A\*+ - - - where P; is in R* 
and is homogeneous of degree h—r;. 


Proof. If h=0 the assertion is trivially true. Assume, therefore, that the 
theorem is true for elements in 0* which are homogeneous of degree less than 
h, and let w* be of degree h. We note that the left-hand side of (8.5) is homo- 
geneous of degree h, so that we may assume that the elements A and B on 
the right-hand side of (8.5) are homogeneous of degree h—1. (This follows 
by the usual application of the mapping 7: K.) By induction, 
then, we have A=) where A; and B; are homo- 
geneous of degree h—rj;—1, j=1, - - - , 4. Hence by (8.5), where 
+B,&*. Since there are at least » elements in 0* which 
are linearly independent over R*, the elements A;#*, As*, - - - , A,* are neces- 
sarily independent over R*, q.e.d. 
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GENERALIZATION OF A PROBABILITY LIMIT 
THEOREM OF CRAMER 


BY 
W. FELLER 


1. Introduction. Let {X,}, #=1, 2,---, be mutually independent ran- 
dom variables, that is to say independent real functions in a space in which a 
probability measure is defined. Let = Prob be the distribution 
function of X;, and suppose that the second moment 


exists; without loss of generality we may then assume that 


(1.2) f = 0. 


Let 
(1.3) +X, 
and 


Finally, let F(x) = Prob { Six} be the distribution function of S,. The 
independence of the X, implies that 


+0 
(1.5) F,(x) = V;(x), = F, (x — y)dVn4i(y). 


The central limit theorem asserts that, under very general conditions, 
(1.6) F,(x5,) = f 


uniformly in every finite interval. If, however, with 2—>© also x it fol- 
lows only that F,(xs,)—>1, but it is not even clear whether 1 — F(xs,) is of the 
same order of magnitude as 1— (x). In the Bernoulli case with 


0 for z<—p<0, 
Vilx) @ V(x) =(1-—p for 
: for 2, 
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the behavior of F,(xs,) when x=x(n)—>© has been the object of detailed 
investigations by A. Khintchine [5]('), N. Smirnoff [7], P. Lévy [6] and 
M. Fréchet [4]. A more general case has recently been treated by H. Cramér 
[2]; he considers the case of “equal components” where all V;(x) are identical, 
Vi(x) = V(x), and it is moreover supposed that the integral 


(1.7) 


exists for all |a| <a9#0. Nothing seems to be known in the case of unequal 
components. The problem as such has been stated by P. Lévy [6, p. 289]. 

The purpose of the present paper is to generalize Cramér’s result to the 
case of unequal components. This generalization seems of interest in itself, 
but it is also to serve as the basis for the establishment of the general form 
of the so-called law of the iterated logarithm. 

We shall consider only the case where | X,,| has an upper bound of the form 
o(s,). Since s,— ©, this restriction is not very strong. In some cases it is possi- 
ble to free oneself from this restriction, provided the “tails” [\2) >, Vn(x) 
are sufficiently small. However, in many applications the principle of equiva- 
lent sequences of random variables will automatically reduce the considera- 
tion to the case of bounded variables. Our proof is based on a straightforward 
adaptation of Cramér’s proof, although we shall avoid the use of character- 
istic functions. Cramér’s proof, in turn, rests on a transformation first used by 
F. Esscher [3] in connection with an insurance problem. In the special case 
of equal components our condition | X,| =0(s,) is more restrictive than 
Cramér’s condition that the integral (1.7) should exist. On the other hand, 
our results will be slightly sharper than Cramér’s, since it will be possible 
for us to free ourselves of the logarithmic term in the remainder without in- 
troducing a new hypothesis. 

2. Results. Throughout what follows we shall suppose that the {x »} are 
subject to the conditions (1.1) and (1.2). Moreover, we shall suppose that 
(cf. (1.4)) 


(2.1) | Xz] < fork=1,---,m 


In most applications the {X;,} will form an infinite sequence, and \,—0. 
Condition (2.1) can then be replaced by | X,| <AnSn, provided that the se- 
quence A, is non-increasing. It should be noted, however, that Theorem 1 is 
strictly a theorem on a finite sequence of random variables. 

In §§3—5 we shall prove 


THEOREM 1. If 


(2.2) 0 < < (3 — 51/*)/4 


(*) The numbers in brackets refer to the bibliography at the end of the paper. 
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then 
(2.3) 1 — Fy(asq) = {1 — &(x)} + 
Here 6 has an upper bound independent of the sequence {Xx k land of n; furthermore 


(2.4) Qn(2) = ¥ 


where the coefficient q,,, depends only on the v first moments of Vi(x), - - -, Vun(x). 
If, more particularly, 


(2.5) 0 < < 1/12 
then 

(2.6) jal <9 

and 

(2.7) | < (1/7)(12An)”. 


Moreover, for any 0<i<j Sn we have 


(2.8) | Ox) — (1/2)(sj — 


Before describing the construction of g,,, in general we proceed to de- 
scribe a few implications of Theorem 1. First let us observe that by applying 
the theorem to the sequence { —X;}, we obtain a corresponding theorem for 
negative x. Since however Q,(x) depends also on moments of odd order the 
behavior of F,(xs,) for x—>— © will, in general, be different. 

Since 


(2.9) 1 — B(x) = 0/9274 — 
with 0<# <1 we can write (2.3) in the form 


which shows the order of magnitude of the remainder as compared with that 
of the main term. 

To appreciate the meaning of the theorem consider the representative 
case of a uniformly bounded sequence { X. x} with s2~n. Then A,=O(n-"/). 
If, then, x =0(n™*) it is seen that Q,(x)—0 and A,x, =0(n—*); thus 1 — F,(xs,) 
will be of the order of magnitude of 1— (x). However, if x is of an order of 
magnitude between n”* and n/4, the order of magnitude of 1— F,(xs,) will 
be that of {1— D(x) for an x between and n*/!* the term 
(1/2)gn,sx* will be significant also, whereas and so on. Still 


= o(1) for x = o(n'/%), 
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so that for x=o(n*) at least log (1—F,(xs,))~log (1—¢(x)). Now if 
x~n"?, the behavior is quite different. Then (x?/2)Q,(x)~const. x, and 
log (1—F(xs,)) will be asymptotically equivalent to (a/2)x? with a1; in 
other words log (1—F,(xs,)) will no longer be asymptotically equivalent to 
log (1—@(x)). A similar change in the asymptotic behavior of 1 — F,(xs,) can 
be observed in the general case and is of fundamental importance in connec- 
tion with the law of the iterated logarithm. 

For use in the same connection we note also the following simple corol- 
lary, to Theorem 1. 


THEOREM 2. Let a>O be an arbitrary constant and let max (2, a)<x 
<1/(100X,) and A,<1/100. Then there exist two constanis 0<a,<a:<1, de- 


pending only on a, such that 
1 — F((x + 
2.11 < < a2. 
( ) 1 — F(xs,) 


Proof. The theorem is an easy consequence of (2.10). The term in brackets 
lies obviously between 1/2 and 3/2. From (2.7) we conclude furthermore that 


3a(v + 2) 24 
= 


100 /’ 


so thatas | —x7(1 +0,(x))| S3a. 
Choosing, say, = 3e~*/? and a,;=(1/5)e-*¢/? the theorem follows. 
Theorem 2 is essentially equivalent to the following more elegant theorem 
which has been discovered by Khintchine [5] in the case of the binomial and 
generalized by Cramér [2] to the case of equal components. 


THEOREM 3. If x—> ©, x=0(1/K,), then 


1—F,((% + a/x)s,) a/zx) 
lim = lim = 


The proof is similar to the preceding one. 
We proceed, finally, to describe the construction of the function Q,(x). 
Let(?) 


(2.12) bi = 


and define quantities y:,, (Thiele’s semi-invariants of V:(x)) by 


+e h 
(2.13) f avs) = 
vm? 


! 


la (?) Note that here as in the following the integral actually extends only over the interval 
x| <AnSa. 
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(we have in particular 


(2.14) 


while Yx,:=0 by (1.2)). We put 


(2. 15) Yk,» 


Then 


2 2 


where h=h(x) is defined as the inverse function of 
(2.17) 


For the first two coefficients we obtain 


1 n 
D 


2.18 = 


1 n 1 n 2 1 n 2 
2.19 2,2 = — — — — bis}. 


3. A transformation. Following Esscher and Cramér we shall introduce a 
transformation which will reduce { Vi(x)} to a new sequence of distribution 
functions { Vi(x)} for which the central limit theorem will become applica- 
ble. For this purpose we introduce a real parameter h >0 which will later on 
be specialized as a function of x and n. 

We put 


+00 
(3.1) pr = f e**dV 


and 
1 z 
(3.2) V.(x) = red 


Then V;(x) is again a distribution function. Moreover, V;(x) obviously varies 
only within a finite interval. Hence the moments 


(3.3) my = 
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and 
2 2 2 
(3.4) -f (x — mx) dV;(x) -f x dV — Mr 
exist. In a manner analogous to (1.4) and (1.5) we define 
(3.5) My = +n, 
(3.6) 


and 


F(x) V(x), 
— 
-f F,(% — y)dVn4i(y). 


(3.7) 


Now by (3.2) 
1 1 
(3.8) f = —f dV.(y) =— V(x). 
Pid Pe 
Hence it is seen that 


holds for n=1; using (1.5) and (3.7) it is seen that (3.9) holds generally. 
Since F,(x) is a distribution function we have, letting x, 


(3.10) 1 — F,(x) = 


Now it is our purpose to apply the central limit theorem to F,(x). It will 
therefore be convenient to write (3.10) in the form 


(3.11) 1—Fya(x) = f dF + 


2—M,)/in 
This formula will be used for x = M7, only. Introducing the quantity 
(3.12) P, = log pe 
kel 
we deduce from (3.11) finally 


(3.13) 1 — F,(M,) = f + 


[November 
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4. Some estimates. Operating for the moment in a purely formal manner 


we have by (2.13) and (3.1) 
h’ 
(4. 1) log Pe = 


A simple computation shows that (cf. (3.3) and (3.4)) 


(4.2) 


(4.3) 


(4.4) 


(4.5) 


(4.6) 


We now proceed to find majorating power series for these expressions. It 
follows from (2.1) and (2.12) that fork=1,---,m 


(4.7) | Be,» | S 
Now by (1.2) 


(4.8) 
v= 2 


v 


We shall actually use only values 0<X.s,4<1/5, so that p, remains near 1. 
It follows from (4.1) and (4.7) that 


y! 


= — log 41 — 1 — si), 
where the sign < indicates that the power series on the left is majorated by 
that on the right. Now write for the moment 


| 
d log px 2 
. ini. = = 
_2 d? log Pe 

Using (2.16), (3.5), and (3.6) we obtain 

h’ 

y=? y! 

he 
T's,» 
(vy — 1)! 
2 

5, = Ts,» * 

(v — 2)! 

2 
< lo 1 
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(4. 10) log {1 - = 
v 


so that p2=1, ps=1, ps=4, ps=11, p, 20. Differentiating (4.10) twice we ob- 
tain 
xe* + 1 = 
(2+ 2-—e7)? (»—2)! 


Substituting here x = 1/2 we obtain easily that 


(1/2)"-* < 3/10, 


whence 
(4.12) 0 S p,/(v — 2)! S (1/2)-2" for »y = 3. 
Comparing (4.9) with (4.10) and noting that of <A3s2 we deduce readily that 


(4.13) | S = 3, 
or by (4.12) 

(4.14) (1/(» — 2)! | ve»| S 

Accordingly we have by (2.15) 

(4.15) (1/(» — 2)!)| S 3, 


whilst I’,,2=s2; this allows us to deduce majorating expressions for P,, M, 
and 8 (cf. (4.4)-(4.6)). 

5. Application of the central limit theorem. Using (4.14) and (4.2) it fol- 
lows that 


2 1 — 2 1 — WnSah 
observing that I’,,2=s2 it follows similarly from (4.6) that 
2 AnSnhk 


2 2 
Now, by definition, Vi(x) (e=1,---, ) varies only within the interval 
| x| <A,Sa. Accordingly, if we restrict \,5,4 to an interval 0<A),s,4<c<1/2, 
V(x) will not vary outside of an interval centered at m, and of length O(,5,). 
It follows (A. C. Berry [1, Theorem 4]) that (cf. (1.6)) 


| + x3.) — (x)| = O(,). 


More precisely let us suppose that, say, 


2 1 AnSnh 22 1 AnSnh 
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0 < 1/10. 


| #/st- 1) <1/8, and | < 


Therefore for k=1,---,m 


f dV. (x + = e**dV 
| 21> (6/5) rnin 


k© | (6/5)Anon 


1 


< 
Pi | 


e**dVi(x) = 0. 


Aasutdinale, using Berry’s theorem, we have 

(5.4) + = + 
with 

(5.5) | Ra(x) | < (13/3)rn. 
Introducing (5.4) into (3.13) we obtain 


Now, using (5.5) we obtain 


R,(0) + fe | s (26/3)rn 
0 


(which result can be derived also by means of the second mean value theo- 
rem). The last inequality together with (5.6) shows that 


(5.7) 1 — F,(M,) = { @(1/2) — + Orn}, 
where 
(5.8) | < 26/3. 


We now have to determine hk so that 


he 


We write the inverse series formally, 


Sa 255 kml 


XSn- 


Using Cauchy’s principle for majorating series and the estimate (4.15) it is 


1943] ee 369 | 
(5.3) | 
Then | 
| 
| 
| 
| | 
- 


370 W. FELLER : [November 


readily seen that the formal inversion of (5.9) is possible at least for all values 
of x for which the inversion of 


(5.11) x = S,h — 


is possible. The substitution 

(5.12) = 4, = 2 
reduces (5.11) to 

i # 


i-s 


the inverse function 
1+ 2u —| 1 — 6u + 4u?|1/2 

is regular in the interval (2.2). Using an inductive argument we readily de- 
duce from the series-expansion in (5.13) that 

20 
du’ u=0 


for all Hence c* 20; we have in particular = 1/2 and c# =3/2. For u=1/6 
we obtain z=1/5 and hence 


(5.14) mat 


= 1/80; 


accordingly we have for »22 
0S (1/12)6"". 

Returning to the original variables x and h it follows that for y=2 
(5.15) | | S 
and 

12 
We note in particular that the restriction of x to the interval (2.5) implies that 
(5.17) AnSnk S 13/144 < 1/10 


so that (5.3) holds; this justifies the numerical estimate of §4. 
For u=(3—5?)/4 we obtain from (5.14) z=(1/2)(1—5-"/?) <1/2. Hence 
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to values of x in the interval (2.2) there correspond values of 4<1/(2AnSn); 
by (4.15) these values are within the interval of convergence of the power 
series 

| 


(5.18) — f., hr. 
vm? v! 


Hence we can write 


2 2 2 2 @ 
(5.19) kM, — P, = = +—D 


end 


where the last series converges at least within (2.2). Now define a new power 
series 


(5.20) = 


by substituting 
x 


into 


x? 


It has been shown that the series (5.21) and (5.22) are dominating series 
for s,k and for hM,—P,, respectively. Hence (5.22) dominates the series 
(5.19) so that |gn,»| Sg%,.. However, an easy computation shows that 
1+0,*(1/12A,) <1.26 and that g* 1 =5A,/3; it follows that g%,,<(1/7)(12A,)’, 
which proves that Q,(x) has the properties stated in Theorem 1. 

Now by (5.7), (5.9) and (5.19) 


(5.23) = { @(1/2) — (h5,)) + Orn}. 


It remains to show that here 453, can be replaced by x = M,/s,. Comparing 
the power series for h3, and M,/s, it is seen that the terms of first and second 
order in h are the same; hence 45, =x+0O(x*d2); more precisely we have 


(5.24) | — | < 2.45d,2%, 
provided only that (2.5) holds. Now let for the moment 
(5.25) f(t) = — 
Then 


| 
| 
| 
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= — (2x)? + — BM}, 
and a well known asymptotic expansion shows that 
0s — S 
Accordingly by (5.24) 

2.45 
— 2.6d2 2%)? 
Thus we have by (5.23) and (5.25) 

1 — F,(xs,) = e~(1/2)2* { + Or. + Pru}, 


where <d,x and | 6;| <26/3 (cf. (5.8)). 
This proves the main part of Theorem 1. It remains only, to: prove (2.8). 
Now it follows from (2.15) and (2.16) that 


s 


| — f(x) | < 


h’ 7 


(V — 2) 


Accordingly, using (4.14), 


kmi+1 
Ps 1 
=—(s;—s 


This implies (2.8) since s,4/x<13/12 and \,5,4<1/10 by (5.16) and (5.17), 
respectively. 


= 7 
x*| Ox) — Q(x)| on 
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THE GENERAL FORM OF THE SO-CALLED 
LAW OF THE ITERATED LOGARITHM 


BY 
W. FELLER 


1. Introduction. The purpose of this paper is to sharpen Kolmogoroff's 
celebrated law of the iterated logarithm in various directions and to give 
best results. For the convenience of the reader, we shall link the statement of 
our problems with an account of the known results. 

1.1. It seems that the law of the iterated logarithm traces its origin to a 
special problem in number theory. From the viewpoint of the theory of proba- 
bility, as we shall see, this problem concerns only a very special case in which 
the general features are hardly visible. Nevertheless, this is the only case in 
which so far a final result has been achieved. 

Let p be a point of the interval (0, 1) and let p=pipeps - - - be its binary 
expansion. (We shall be concerned with statements of the “almost every- 
where” type, so ambiguity will do no harm.) Let 


+1 if p= 1, 
—1 if = 0. 


Then S,=Xi+ - - - +X, is the excess frequency of occurrence of the digit 1 
among the first » places in the expansion of p. The strong law of large num- 
bers (Borel, Cantelli) asserts that almost everywhere S,=0(n). The follow- 
ing enumeration of sharper results indicates the historic development of the 
problem. It is, of course, understood that all assertions hold true only_for 
almost all points p. 

(i) Hausdorff, [6], 1913: 


= { 


S, = e> 0. 
(ii) Hardy-Littlewood, [4], 1914: 
S, = log n)*/*). 
(iii) Steinhaus, [16], 1922: 
lim sup log 1. 


(iv) Khintchine, [7], 1923: 
= O((m log log n)*/). 
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(v) Khintchine, [8], 1924: 
lim sup S,,/(2n log log n)"/? = 1. 


In formulating newer results we shall, following P. Lévy, say that a function 
¢(t) belongs to the upper class (CV) if, for almost all », there exist only 
finitely many m such that S, >/%(m); and we shall say that ¢(#) belongs to 
the lower class(') (€.C) if, for almost all the inequality S,>n/*o(m) is 
satisfied for infinitely many m. Khintchine’s result (v) then reads: 


EV if a>il, 
a(2 log log #)*/? 
EL if a<1. 
(vi) P. Lévy, [11], 1933: 
EV if 


2 log log ¢ + a log log log #)'/? 
(2 log log ¢ + a log log log #) EL if «<1 


(with a gap for 1<a33). 
(vii) Kolmogoroff-Erdés(?), (1937-1942) : If $(é) is non-decreasing, then (*) 


EV EC 


The last result contains, of course, the preceding ones. It gives, in a most 
elegant way, a complete solution for the binomial case with probabilities 1/2. 
It is our purpose to find a similar solution for arbitrary independent random 
variables, that is to say, for arbitrary bounded real functions X; in arbitrary 
spaces. As a matter of fact, the above solution is so specialized that it fails 
already in the next simplest problem concerning the frequency of occurrence 
of digits in the ordinary decimal expansion. 

1.2. For a general formulation of the problem consider an infinite sequence 
of spaces {E,}, k=1, 2, - - -, in which probability-measures are defined(‘). 
Let E=Ei:XE.X be their infinite combinatorial product 


(?) According to the law of 0 or 1 each ¢(#) belongs to one of these classes. We shall not use 
this result in the sequel. 

(*) Kolmogoroff’s result has been stated, without proof, in P. Lévy’s book [12]. Erdés [2] 
proved the result completely. Before, J. Ville [17] had proved that convergence of the integral 
is a sufficient condition for the upper class. It is the necessity of this condition which presents 
the real difficulties. Most of the results of the present paper have been obtained before publica- 
tion of Erdés’ paper. 

(*) Here and in the following @ and D stand for “converges” and “diverges,” respectively. 

(*) In each E, there is a Borel family of sets § such that Ex€ Fz; on Fz we have a com- 
pletely additive set-function Pr(I) with OS Pr(T)S1 and Pr(E,)=1. 
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and let a probability-measure in E be defined in the customary manner(‘). 
Let X,=Xi(fx) be a real-valued function of a point p, varying in E,; then(*) 
(1.1) Sou +f, 

is a function of the point p=: Xp2X --- CE. Let 

(1.2) Vi(x) = Pr x}, ow, 


be the distribution function of X;. Then the distribution function F,(x) of S, 
is found from 


+00 
f Fa(z — 
We shall suppose that 


exists; replacing, if necessary, X; by Xz+c;, we can without loss of generality 
assume that 


(1.4) = 0, k=1,2,---. 


The second moment of F,(x) is 


(®) We shall actually consider only sets of the type T= XIy,X +++ 
XTy,X +++, where I'y,€ and the dots stand for Z,. Of course, Pr(T)= - Pr(Ty,). 
If and A=(Aj,, , Aj,,) are two such sets, they will be called independent, 
if 4, ~j.; then Pr(T'A)= Pr(T)Pr(A). For our purposes it is sufficient to have a measure-theory 
for finite combinatorial products and the corresponding notion of sets of measure zero in the 
infinite product space. 

(®) In this formulation it becomes clear that, as far as mathematics is concerned, we are 
dealing with a problem in real function theory in product spaces. The sense, and importance, 
of our investigation for probability and statistics is, it is hoped, obvious. The subscript & stands, 
of course, for the order number of the experiment or observation. Ex, the &th label space, is the 
abstract expression for the set of all thinkable results of the kth experiment, each result being 
represented by a point p,rG Ezy. (In practice these results will be: head-or-tail; the position of the 
roulette; the position of a particle subjected to diffusion or, more generally, the phase-space in 
physics; or the complex composition with respect to age, claims, risk, and so on, of the stock of 
policies of an insurance company.) X, will be the gain or any other characteristic in the kth 
experiment and we are interested in the fluctuations of the accumulated value of that charac- 
teristic after » experiments. The epistemological problem of what “probability” means has no 
more to do with theory or practice than the corresponding problem of space conception in 
geometry. 

It may be remarked that we do not attach any importance to the possibility that the Ey 
are not similar to each other. The fact of the situation is that the structure of E, simply nowhere 


appears. 


| 
| 
| 
| 
| 
| 
| 
| 
| 
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(1.5) 


For obvious reasons only the case where 
(1.6) Sn © 


presents a problem. 
We shall, following P. Lévy, say that a sequence { on} belongs to the lower 
class (ELL) (with respect to {Xx}), if, for almost all pCE, there exist infinitely 


many n such that 
(1.7) Sa > Saba; 
and to the upper class (CV) tf, for almost all pCE, there exist only finitely 
many n such that (1.7) holds. (It will be noticed that Theorem 8 actually con- 


tains a sharpening of the notion of upper class: in defining. the upper class 


we could as well replace (1.7) by (2.19).) 
1.3. The law of the iterated logarithm states: if each X;, is bounded and 


(1.8) Lu.b. | = o(s,/log*! Sn) 


uniformly in k=1, 2,---, m, then 

EUV if a>2, 

EL if a<2. 

This is the most general result known; it was proved by A. Kolmogoroff 
[10], after important special cases had been solved by A. Khintchine [9]. 
A new proof of (1.9) under more restrictive conditions than (1.8) has been 


given by P. Lévy [12]. Under much stronger conditions Cantelli [1] has 
proved a result which is sharper than (1.9): suppose that for some ¢>2 


+00 
f | x k=1,2,---, 


(1.9) (@ logs s,)*/? 


exists, and that uniformly in k 


+00 +e 
f x*dVi(x) 2 ao, f | x|*dVi(x) 
then (even not assuming (1.8)) 
EV if ¢>3, 
EL if 
the gap between 1 and 3 is left open. (Cf. above P. Lévy’s result vi.) 


There is a most surprising result due to Marcinkiewicz and Zygmund [15] 
which is important for the proper understanding of our problem. They showed 


(2 loge Sa + @ logs s,)'/? 
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that if (1.8) be replaced by the only slightly less restrictive condition: 


(1.10) Lu.b. < 


(€ arbitrary but fixed), then (1.9) becomes false. In fact, they constructed a 
sequence { X;} satisfying (1.10) but for which 


(1.11) lim sup (S,/5,(2 logs s,)*/*) < 1. 


1.4. In the present paper we shall give necessary and sufficient conditions 
for a sequence {¢,} to belong to the upper or lower class. It will be seen that 
our criterion assumes the simple form of Kolmogoroff’s criterion (vii) if the 
{ are subject to the condition(’) | =0O(s,/¢3) (Theorems 2 and 6). 
As this condition is gradually relaxed the criterion assumes an increasingly 
complicated form. Thus, if | Xz | =0O(s,/¢2) the criterion will involve third 
moments of the distribution functions V(x); this implies a remarkable lack 
of symmetry, since now { on} may belong to the upper class for the sequence 
{X,} and to the lower class for {—Xz} (Theorem 3). If it is assumed only 
that | X,| =O(s,/¢%*) the fourth moments of { Vi(x)} will be of influence also, 
and so on (Theorem 5). Finally, when the upper bounds of | X,| are of the 
order C(s,/,) all moments of { V(x) } enter into consideration; when C is 
very small, the dominant influence will still be exercised by the second mo- 
ments; however, as C increases, the dominant role gradually shifts to moments 
of an increasingly higher order. It is thus seen that if the upper bounds of 
|X| increase faster than s,/¢,, we shall be confronted with an altogether 
new situation and any criterion would necessarily be of a quite different 
form(*). This domain is not investigated in the present paper. The criterion 
given in Theorem 1 still covers the case (1.10) (with e=1/200) and thus ex- 
plains the phenomenon detected by Marcinkiewicz and Zygmund by giving 
the precise limit between the upper and the lower class. 

It will be seen that the present paper considers only individually 
bounded variables X,. It is hardly necessary to point out that most sequences 
which occur in standard applications can be reduced to this case by the cus- 
tomary method of equivalent sequences (truncating the X; at values M; such 
that Pr{|X;| converges). This method makes it evident, for ex- 
ample, that Kolmogoroff's law of the iterated logarithm is applicable if 


(") For a proper understanding of the theorems (as well as for a comparison with Kol- 
mogoroff’s condition (1.8)) it should be noticed that we shall be interested usually in sequences 
{¢n} near the borderline between the upper and the lower class. That is to say, roughly speak- 
ing ¢» will increase like (2 log log sa)*/*. 

(®) This observation is also borne out by results of P. Lévy [13] and Marcinkiewicz [15], 
who investigated the analogue to the law of the iterated logarithm in the special case where 
cx7* S1—V;,(x) Cx-*, with 0<a<1 and two constants C>c>0. 
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V,(x) = V(x) where V(x) has a finite moment of order 2+¢>2(*). Actually 
it is possible to extend also the exact criteria of the present paper to very large 
classes of sequences {X;,} of unbounded variables. An example of such a 
class (containing most usual sequences) is given in the appendix. In the 
construction it seemed desirable to aim at greater simplicity rather than at 


greatest possible generality. 

Our proofs are direct and do not presuppose previous work on the subject. 
Beyond standard tools of measure theory we shall use only a theorem de- 
scribing the behavior of F,(x) for large x; this theorem is a sharpening of the 
classical central limit theorem of probability and has been proved in a previ- 
ous paper("*). 

2. Theorems. Throughout the sequel X;, will stand for a real function in E, 
(see §1.2); S, ts defined by (1.1) and Vi(x) by (1.2); tt shall be assumed that 
(1.3), (1.4), and (1.6) hold. Moreover we shall suppose that there exists a se- 
quence d,, | 0 such that(*) 


(2.1) Lu.b. | Xa| < Ansa» 
where s,, is defined by (1.5). Of the sequence { on} we shall assume that(**) 
(2.2) 


2.1. For the most general case of our criterion we shall require a function 
Q,(x) which regulates the behavior of F,(xs,) for large x. Define, in a formal 
way, quantities 7,,, and by 


+00 
log f = 
vm? v 


and 


(*) This seems to have escaped the attention of Hartman and Wintner [5], in their strong 
remarks about the inapplicability of Kolmogoroff’s law to sequences occurring in standard 
applications. Actually it suffices to suppose that fe] log| x| [14g V(x)=A is finite for some 
e>0. Then >. 5100/1 log log V(x) <A log sx)*/k log*** kconverges; the truncated 
variables obey the law of the iterated logarithm and the variances of their sums are asymptoti- 
cally equivalent to s* . Hartman and Wintner [5] have devised a slightly more efficient method 
of truncation which shows that it is sufficient to suppose that JPtd V(x) exists. It does not fol- 
low that in this case also our criteria hold. 

(*) Feller [3], in the sequel referred to as L.T. The present paper should be readable with- 
out knowledge of L.T. 

The monotony of {An} is used exclusively to assure that l.u.b. | <AnSa for 
k=1,2,+++,m. 

(2) It seems only natural to assume monotony of {¢,}. The restriction is unessential and 
the proofs become even slightly simpler if, instead, it is assumed that the variation of {¢.} 
in any interval is small as compared with 1/¢.. However, abrupt changes of ¢, are interesting 
and considered in the sequel. If {¢.} remains bounded, the theorems become trivial. The as- 
sumption ¢, >2 is convenient and presents no loss of generality. 


LAW OF THE ITERATED LOGARITHM 
= Vk 
kel 
(Ye,1=0 by (1.4); only small values of & will be considered and it is under- 


stood that the +z, are real). Then 


2 1 
{1+0.(2)} = —— 
2 


where 4 =h(x) is the inverse function of 


Sn pon? — 1)! 


It has been shown in L.T. that, if xA, <1/12, Q,(x) is an analytic function 


(2.3) = 


and 
(2.4) | | < (1/7)(12An)’. 
In general, gn,» depends on the first y-+2 moments of Vi(x), -- +, Va(x), 


+00 
(2.5) bk.» = f 1, + 2. 


It follows from the main theorem of L.T. that, under the conditions of the 
present paper, the ratio of F(sa¢,) tog,’ {exp ]} remains 
between two positive constants. This explains the importance of the function 
Q(x) for the present investigation. 

2.2. In the following theorems the main conditions ((2.6), (2.8), (2.12), 
and so on) depend on the sequence { ¢.} which is to be tested..Although these 
conditions seem most natural, it is sometimes more advantageous to replace 
them by conditions which do not explicitly depend on {¢,}. Such alterna- 
tive conditions will be formulated in Theorem 11. 


THEOREM 1. if 
(2.6) An 1/2006, 
then CU(L) if, and only if(*), 


(2.7) = $n exp {— (1/2)42[1 + On(on)]} € C(O). 


(#) Cf. footnote 3. 


2 
| 


380 W. FELLER . [November 


Postponing the proof of the theorem we mention a few corollaries. 


THEOREM 2. Jf 
(2.8) = 
then CU(L) if, and only if, 
2 


In fact, using (2.4) it is seen that (2.8) implies ¢20,(¢,) =O(1) and Theo- 
rem 2 follows from Theorem 1. Similarly one proves another corollary: 
THEOREM 3. If 
(2.10) An = O(1/n) 
then {bn} CU(L) if, and only if, 


on 2 
n k=l 
where b;y,3 is defined by (2.5). 
In the case of symmetric variables X, the moments };,3 vanish. Therefore: 
THEorEM 4. If 1— Vi(x) = Vi(—x) and (2.10) holds, then CU(L) if, 
and only if, the series (2.9) converges (diverges). 
In general we have in the same way the following theorem. 


THEOREM 5. If, for some integer p>1, 
(e+2) /p. 


(2.12) An = O(1/bn 
then {bn} CU(L) if, and only if, 

on 2 3 p+1 
(2. 13) 3 exp { (1/2) [on + + + Qn,p—19 }} C(D); 
thus the behavior depends only on the first p+-1 moments of { Vi(x)}. We have in 
particular 


1 n 
>> 


388 


1 n 1 ® 1 n 2 
= —— — — — —( bua). 
4s§\ 


12s* 


Qn,1 = 


In §7 it will be proved that the criterion of Theorem 2 can be reformulated 
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in a way analogous to Kolmogoroff’s criterion in the binomial case as follows. 
THEOREM 6. If and 

(2.14) dn = 

then {$(s2)} éf, and only if, 


The connection between our theorems and the law of the iterated log- 
arithm becomes more apparent in the following theorem. 


THEOREM 7. If either 
(2.16) Ae = O(1/logs $4), 
or Vi(—x) =1—Vi(x) and 
(2.17) An = O(1/logs Sn), 
then 
>, = {2 logs sa + 3 logs sa + 2 logs sa + cee 
(2.18) + 2 log,1 sa + (2 + 8) log, VUL) 
if, and only if, 
5 > 0(< 0). 


This theorem is actually a simple corollary to Theorems 2 and 4. For the 
proof it suffices to note that, with the definition (2.18), ¢, = (2 logs s,)/?+-0(1), 
so that (2.16) and (2.17) are equivalent to (2.8) and (2.10), respectively. 
Thus the criterion (2.9) is applicable. However, 


+ o(1)) 


S* log s? logs s2 - - - logi*¥/? 5? 


now Ye diverges, and therefore(™) the above series converges if, and only 
if, 5>0. 

_ We shall prove the following sharpening of the notion of lower class, which 
holds true in all cases considered. in. this paper. 


(*) This follows easily from theorems of Abel-Dini and of Cesaro. See, for example, 
K. Knopp, Theory and applications of infinite series, London-Glasgow, Blackie and Son, 
1928, pp. 292 ff. Theorem 7 is, of course, an immediate consequence also of Theorem 6. Special- 
ized to the symmetric binomial case it becomes a consequence of the theorem of Kolmogoroff- 
Erdés. It should be noted that the terms s,, in (2.18) can be replaced by Sa. 
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THEOREM 8. Under the condition (2.6) of Theorem 1, if {dn} CL then for 
any couple of constants a<B and at almost all points, the inequality 


(2. 19) Sn(On + a/ dn) < 5S. < Sn(On + B/¢n) 
will be satisfied for infinitely many n. 
Hence: If {dn} EL also {on+a/dn} 


Besides the criterion of Theorem 1 we shall prove the following (less 
handy) criterion: 


THEOREM 9. Under the condition (2.6) of Theorem 1, the sequence 
{¢.} GUL) if, and only if, for any sequence of integers {n,} for which 
(2.20) + < < + 
where a and b are constants, the series 
(2.21) Pr > CM). 

We mention also the following corollary to Theorem 1, which follows easily 
by means of (2.4). 

THEOREM 10. Under the condition (2.6) of Theorem 1 if 


AY 


the sequence 

(2.23) {on + Anda} VU; 
if (2.22) diverges, 

(2.24) {Gn — Anda} 


Finally, we give an alternative form for the conditions of the preceding 
theorems. Its usefulness will be exemplified by the results of the appendix. 


THEOREM 11. The criterion of Theorem 1 remains valid if the condition (2.6) 
ts replaced by 


(2.25) An S 1/200(log log s,)*/*. 


A similar remark applies to Theorems 2-6. If (2.25) is satisfied, the sequence 
{¢.} belongs to the same class as the sequence {p,} defined by 


(2.26) = min {ony 2(log log Sa) 4/2}, 


For a generalization of the preceding results to the case of unbounded 
variables {X,} the reader is referred to the appendix. 
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2.3. Remarks. The following remarks are either obvious or easily verified. 
They will not be used in the sequel, and their detailed proof can therefore be 
left to the reader. 

1. Theorem 8 contains a best result in two directions. The assertion that 
{¢.} and {¢,+const/¢,} belong to the same class cannot be improved. First, 
if {X,} is a given sequence and a,—>~, it is possible to find a sequence such 
that {¢,} EL but {¢.+2./¢.} EU (cf. Erdés [2] in the case of the binomial 
distribution with equal probabilities). More interesting is that to any given 
{pn} and a, it is possible to find a sequence {X,} such that {¢,} EL 
but {¢n+an/dn} GV. The fact that {¢,} and {¢,+a/¢,} belong to the same 
class is easily established independently of the previous results. In fact, it 
is sufficient to notice that | (¢,+4/$n)*Qn(n+2/n) —G2Qn(n)| =O(1), and 
this is readily proved using (2.4). 

2. If An =O(1/¢3) then {on} is of the same class with respect to the se- 
quence {X,} and {—X,}. This is a best result. If a, is an arbitrary se- 
quence it is possible to find a sequence {X,} with \,=O(a,/¢3) such that 
{on} EU for {X,} but {¢.} EL for {—X,}. 

3. It has been conjectured by P. Lévy [12, p. 266] that Kolmogoroff’s 
criterion (Theorem 6) is applicable for any uniformly bounded sequence 
{X,}. He put the problem of finding the weakest conditions on {A,} under 
which the theorem would hold. Theorem 6 gives the dest result. If \,¢3— ©, 
it is possible to find a sequence {X,} such that the criterion of Theorems 2 
and: 6 becomes false. 

4. It will be noticed that the condition (2.16) is sharper than Kolmo- 
goroff’s condition (1.8). However, (2.16) is the best condition. If \, log?” s,— © 
it is possible to construct a sequence {X,} such that (2.18) does not hold. 

5. It is interesting to compare Theorem 10 with the classical law of the 
iterated logarithm, and with our theorems. If \, =O(1/¢3) then the sequences 
{bn tugs} obviously belong to the same class, and Theorem 10 reduces to 
Theorem 1. However, as the order of magnitude of {Xa} increases, Th<o- 
rem 10 becomes less and less sharp. With the law of the iterated logarithm 
one knows only that A, =0(1/¢,); hence the assertion relates only to sequences 
of the form {¢,+¢¢,}. When A,=O(1/¢.) even this breaks down, which 
explains the behavior of the counter-example constructed by Marcinkiewicz 
and Zygmund. 

6. It is hardly necessary to point out that the constant 1/200 in Theo- 
rem 1 (2.6) is convenient but purely arbitrary. Actually 1/12 would have 
been sufficient (except for minor complications). There is no best constant, 
as the radius of convergence of Q,(x) depends on the sequence { X,}. 

3. The sequence {n,}. We proceed first to establish the existence of se- 
quences of the kind occurring in Theorem 9: If condition (2.6) is satisfied then 
to any constants a, b with 


| 
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(3.1) 0 < a < (99/100)8, b > 1/100, 


there exists a sequence {n,} such that (2.20) holds. Obviously m—. 
The proof is very simple. Since, by assumption, s,—> © we can choose m 
arbitrarily and define a sequence {m} by the recurrence relations 


(3.2) & Say(1 + 
(3.3) > Say(1 + 
Then by (1.5), (2.1), (2.6), and (2.2) 


2 2 2 2 —4 —2 2 


2 —4 —2 2 


or, using (3.3) and (3.1), 


sa,(1 + 


In the following { ny} shall stand for an arbitrary, but fixed, sequence satisfy- 
ing (2.20). We shall write 


(3.4) Sap =Sé, Sng = Sk, = Oh, = (x), Ang = 


This will greatly simplify the notation, as we shall require a subsequence 
of {”,} and, in turn, a subsequence of it. 


4. Lemma 1. Suppose that condition (2.6) is satisfied and that 

(4.1) > Pr {Si > sioé} EO. 
Let e>0 be fixed. Then at almost all points the inequality 
(4.2) < Sn < + 3€/on) 

will be satisfied for infinitely many n. 

Before proceeding with the proof let us remark that, putting 
(4.3) pr = exp {— + ]}, 
condition (4.1) is equivalent to 
(4.4) EOD. 

Indeed, by Theorem 1 of L.T. 
Pr {Si > < const. pi. 
Moreover, by Theorem 2 of that paper, to any e>0 there correspond two con- 
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stants 0<a;<as, depending only on e¢, such that 
(4.5) arpr < Pr {sige < Si < si (oi + < 
Accordingly, (4.1) is equivalent to the statement that 
(4.6) Pr < Si < si (od + EOD. 
4.1. For the proof of Lemma 1 we require a sequence {k,} defined by 
the recurrence relations 
where M>1 is a constant to be determined later; #1 can be chosen arbitrarily. 
It is readily seen that 
(4.8) (2/(2 + < S 
and 
(4.9) — S (M’/a) 
In fact, using (4.7), (2.20), (3.4), and (2.2) it follows that 


< S Stell + S (1 + 5/4) 
which obviously implies (4.8). In the same way 


1 ‘TL a 1 a \ 
1 a 


which proves (4.9). 
4.2. For any i<j let us define 


= 


(4. 10) Sig = —-Si= > Xn 


Then 
(4.11) Fi) = Pr 2} 


is the distribution function of Sj,;, and obviously: 
(4.12) f «dF ;,;(x) = 0 


and 
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+00 


Now denote by A,,m (m2k,) the set 
(4.14) = E{smlom + €/Om) < Si, < + 


where ¢ is an arbitrary, but fixed, number with 
(4.15) O0<e<l. 
We propose now to show: if 
(4.16) ky Sm < 
(4.17) Pig < 
and if 
(4.18) (2 + b)?/M? < ¢/4, 
then there exists a constant n>O (depending only on €) such that 
(4.19) Pr (Ay,m) > 1Pm 
for all v sufficiently large. 
Proof. Since the sequence s/ is not decreasing and m>k, we obtain, using 
(4.16), (4.8), (4.17), and (4.18), 
= Sm — Sm{1 — Sm {1 — (2 +6) /(4M } 
whence 
(4.20) — S S 
Accordingly, putting 
(4.21) Arm = Ef Si, + 3¢/(26m)) S + 
it is readily seen that 
(4.22) 
To evaluate Pr (A¥,,) we can apply Theorems 1 and 2 of. L.T. Indeed, Sj, _,:m 


is a sum of independent random variables X;; moreover, it is easily seen 
(using (4.20), (4.15), (2.2) and (2.6)) that 


Lu.b. | Xz| < (1/100) 5x, 
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so that Theorem 2 is applicable. It follows immediately that 


Pr (Aym) 2 Pr (Anm) 


4.3. Now put 


(4.23) 


m=ky 


and 


(4.24) P,= bm 


m=ky 


We shall show: If (4.17) and (4.19) hold, then there exists a constant n1>0 such 
that 


(4.25) Pr (A,) > mP,, 
for all v sufficiently large. 
Proof. Let N be a fixed integer and consider the sets 


(4.26) Ave = Ara — A A 


p=m+iN 


Obviously 


kyt+1—1 
(4.27) A,= > 


maky 


On the other hand, it follows from (4.26) that the intersection of two sets 
A. and A, will be empty unless | m,—m,| <N. Any point of A, is there- 
fore contained in at most N distinct sets A® figuring in (4.27) so that 


(4.28) Pr (4) 2 > Dd Pr 


maky 


Again it follows from the definition (4.14) of A,,, that for any point of 
A, (o>m) 


(4.29) Sine > + €/6,) — + 2€/bm) > (Sp — Sm) bp — Sm(2€/bm)- 
In other words 

(4.30) Ay,mAr» C 

where 


(4.31) = E{Smp > (5) — Sm) — 
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However, the sets A,,» and B,,, are independent (see the footnote 5 on p. 375), 
so that 
(4.32) P(Av,mBm.p) = P(Av,m)P(Bm,p)- 

Comparing (4.26), (4.28), (4.30), (4.32), and (4.19) we see that 


ky+1—1 


p=m+N 


p=m+N 
n kyt+1—1 ky+1—1 
N m=ky p=m+N 


We turn to the evaluation of Pr(B,.,,). For that purpose we write the inequal- 
ity in (4.31) in the form 


(4.34) 

where 

(4.35) 

To evaluate the probability of (4.34) we may again apply Theorem 1 of L.T., 
provided that all the X,, with »<, have an upper bound not exceeding, say, 
(1/200¢, #) Now for n<n, 


| X,| 


1 
200¢/ 


and by (4.35) 
te Sm) _ 


¢, , 


Theorem 1 of .T. is therefore applicable and 


Pr = Pr {Sans > 


4.36 1 “1 
(4.36) < mz, { — (1/2)¢p [1 + }} 


provided, say, ¢*>2. Using the estimates (2.4) for the coefficients of Q* it is 
readily seen that 


(4.37) Pr (Bmp) 95", 


1 ky+1—1 
Pr(4,)2— Pr} Ann 
N m=ky 
<1. 
+ 


1943] LAW OF THE ITERATED LOGARITHM 


Now by (4.35) 
Sp — Sm Sm 


(st? — s/2)12 (si? — ra 


* 
> = 


2e 


1 


However, by means of (2.20), (3.4), (4.16) and (4.17) we deduce that for 


a a(p — m) 
4.39 = 


This inequality will be applied in three ways. 
(i) Since p2m-+N it follows that 


using (4.17) it is therefore seen that the last term of the right-hand member 
in (4.38) is small if N is large. Hence (4.38) implies, for N sufficiently large, 


/2 


(4.40) $, = (1/2)(1— sn/s,) 
(ii) Consider now p with 

(4.41) a(p — m) S on. 

Combining (4.39)-(4.41) we obtain 


/10. 


1 1 gil? gil? 
(4.42) $2 mat To (p — m)*!?, 


1+ a(p — m)/(49"2) 10 

Comparing (4.37) with (4.42) it is readily seen that, for N sufficiently large, 

min { ky+1—1, m+ 1 
(4.43) Pr (Bmp) S  —, 

p=m+N p=m+N 3 
(iii) Finally, consider p with 

(4.44) a(p — m) = bm 
For these we obtain from (4.39) s/?/s/?25/4 and from (4.40) 


(4.45) = (1/10)(¢, — 
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since k,<p<k,, and since ¢,/-+~, it follows from (4.45) and (4.17) that 
for v sufficiently large 


(4.46) = (1/30) 
Combining (4.37) and (4.46) shows that 


ky+1—1 ky+1—1 


However, (4.9) implies that the last expression tends to zero as vy. Ac- 
cordingly, for v sufficiently large, 


kyt+1—1 1 
(4.48) Pr (Bu) < 
p=m+[¢3/a) 3 
Substituting for Pr (B,,,,) from (4.43) and (4.48) in (4.33) we obtain 


n kv+1—1 n 
(4) 2 oy 
which proves the assertion (4.25). 
4.4. We propose now to show that almost all points belong to infinitely 
many A, or, in other words, that 


(4.49) Pr( > A,) =1, 


for any ». For that purpose we note that the set A, depends only on the vari- 
ables X, with m,_, Sn" <mz,,, (cf. (4.14) and (4.23)). Accordingly, the sets As, 
are mutually independent, and 


(4.50) 4.) = Pr Pr(A2)), 


and a similar relationship holds for odd subscripts. Now it would be trivial to 
deduce (4.49) from (4.50), if we could use (4.25). However, the last inequality 
has been proved only under the assumption that (4.17) holds. We shall show 
that the v’s for which (4.17) does not hold are of no influence. 

Let(#*) {»(r)}, r=1, - - +, denote those integers for which 
(4.51) Pine S 
We shall prove that 


(4.52) Dd Pur EC. 


(*) For simplicity in print the notation »(r) has been introduced instead of the usual »,. 
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In fact, it follows readily from (2.6), (2.4) and (4.3) that peS(1/od Je2*r, 
and therefore 


(4.53) be 
for k sufficiently large. We find by means of (4.8) and (2.20) 


ky+1—1 ky+i—1 
1 a 1 


Thus by (4.53) and (2.2) 


ket1—1 1 


P, = Pm 
m=k, 
or » 


M? 

a@ 
but (4.51) implies that the sequence {¢:,,,,} increases so rapidly that the 
series (4.54) converges. Accordingly the series >> ,«»)P, diverges, and without 
loss of generality we may suppose that )>s,«,:)Ps, diverges. It follows from 
the definition (4.51) that for y¥v(r), (4.17) holds. Accordingly, for these »v the 
inequality (4.25) holds, and therefore 


Pr( > Ay) = 1 I] Pr (4) 


(4.55) 
21- [J (t1-mP,) =1. 


4.5. The proof of Lemma 1 is now easily accomplished. We know that 
almost all points belong to infinitely many sets Az, with 2v#v(r), that is to 
say, such that 


Denote then by C,, the set 
(4. 57) Cy = (1/8) < < (1/8) 41 } 


where 5>0 is arbitrary. At any point of the intersection A2,,C2, we have 
simultaneously the inequality figuring in (4.57) and 


(4.58) + €/bm) < < Su(bm + 
Adding these inequalities we find that in As,mC2, 


(4.59) + Sme/dm — < Sm < + 2€ Sm/bm + 


72 
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However, for ks, S m <Rari1 we have, using (4.8) and (4.56), 


thus, if the constant M is chosen so that 
(4.60) (2+6)/M <6 
we obtain from (4,59) that at any point of As,_C2, 
(4.61) < Sm < Salm + 3€/bm)- 
Denote the set of points for which (4.61) holds by An. Then An DAs,,mC2>, 
and therefore (always for 2v¥v(r)) 
kay+i—1 
Pr( = Pr (A2,C») 


m=key 


or 


(4.62) Pr( An) 2 Pr( > ACs). 


mek, v2), 


Again, it follows from the definition (4.57) that (Tschebycheff’s inequality) 


Pr (C») = f 4 f 
_, 


On the other hand, the sets Az, and Cy, are independent if p2v (cf. footnote 
5 on p. 375). We have therefore 


Pr( > Axl») = > > Pr 


(4.63) 


(rv) 


Pr A» > Ay) 


Pr (Cx) Pr > A) 


= (1 — 8) Pr( Ay) = (1-8) 
v2), 


(by (4.55)). This proves that all points, with the possible exception of a set 
of measure 6, belong to infinitely many sets A,,; at such points the inequality 
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(4.61) is satisfied for infinitely many values m. Since 6 is arbitrary, and the 
definition of A,, is independent of 6, it follows that at almost all points the 
inequality (4.61) will be satisfied for infinitely many m. This proves Lemma 1. _ 


5. Lemma 2. Suppose that condition (2.6) is satisfied and that 


(5.1) dX Pr {Si > sist} EC. 
Then . 
(5.2) {on} EL. 
As before (cf. §4) we see that condition (5.1) is equivalent to 
(5.3) 
where ?; is defined by (4.3). We shall require three sets: 
(S.4) A,= E{Sn > Saba}, 
(5.5) Bae = Sum > — <n 
(5.6) = Ef Su > — (2 + 


(for the definition of S,.. and Sa. cf. (4.10) and (4.13)). It is sufficient to 
prove that 


(5.7) 4.) 
n=N 

for N sufficiently large. 

Again we deduce from (2.20) for m1<nSm 
on the other hand by definition and by (5.8) 

= Sax — Sn 

Adding the inequalities in (5.4) and (5.5) and using again (5.8) we see there- 
fore that at all points of the intersection A,B,,. 
Sa, = Sa + > bbn/ 2b 

= + [On — [1 — — [6 + 

> — (2 + 

provided ¢2, ,>b. Comparing (5.6) with (5.9) it is seen that 
(5.10) Ce D <n Sm. 


(5.9) 


| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 


394 W. FELLER q [November 


Furthermore, using Tschebycheff’s inequality (cf. (4.63)), we obtain 
(5.11) Pr (Bax) = 3/4. 


Now it should be noticed that the sets B,,, and A,, are independent if m Sn. 
Therefore 


Pr (Ci) Pr ( > 


nk n 


= >> Pr AnBm.s} 


"hk n 


n=n,—1+1 


= > Pr(Baa) Pr (4, A. > An) 


n=—np—1+1 m=np—y+1 


n=np—1+1 


or 


(5.12) Pr( > An) < Pr (C,). 


However, Theorem 2 of L.T. asserts that Pr(Cz) < m2pe_-1, where 72 is a constant 
independent of k. Accordingly 


Pr( > A.) Pr( A.) S—m Do 
n=ny k=N—1 3 k=N-1 
which proves (5.7). 


6. Lemma 3. Suppose that condition (2.6) is satisfied, and that the series 
(2.7) diverges (converges). Then 


k On, 
diverges (converges) for any sequence {nx} satisfying (2.20). 


REMARK. It was pointed out earlier (cf. the introductory remarks to 
§§4 and 5) that (6.1) converges or diverges simultaneously with 


(6.2). Pr {Sa, > 


k 
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Thus Lemmas 1 and 2 together are equivalent to Theorem 9 with the sharper 
form for upper functions provided by Theorem 8. The three lemmas together 
prove Theorem 1 and Theorem 8. From the previous lemmas it follows that 
the behavior of (6.1) is necessarily the same for all sequences {m,} satisfying 
(2.20). However, it should be noted that the following proof is absolutely 
independent of the previously established results. 

Proof. As before, let {m,} stand for an arbitrary but fixed sequence satis- 
fying (2.20). Since o2 = 0(s?), we shall have, for sufficiently large(*) 


(6.3) = log = — log (1 — o,/S.) = 


and hence 


= és 2 2 2 2 
(6.4) S log log (1 + S 
n=nptl 


and similarly 


(6.5) (1/2) log (1 + 0/63,) 


Suppose that the series (2.7) diverges, and define the sequence {m,.} by 


om, exp {— (1/2)bm,[1 + 
= max exp {- (1/2)oa[1 + Qn(¢n)]}. 


(6.6) 


Then 
(6.7) ne < S 


and by (6.4) 
2 


2 
Cn 


n 3? k 
(6.8) 


< ms 67-6112) ding (1+ (Oma), 


k Gi, 


Unfortunately, the ratio ¢n,/dn, is not necessarily bounded. Still, we shall 
show that those k, for which ¢n,/¢n, is not near unity, can be discarded. For 
that purpose, we make the following simple remark, which will be useful in 
the sequel: If for a sequence(*) k(1)<k(2)<k(3)< --- 


(6.9) +1 > Par + 2/ 


then 


(*) The argument leading from (6.3) to (6.4) and (6.5) is due to Cesaro; cf. Knopp, loc. 
cit. p. 292. 
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(6.10) 


r 


converges. In fact, the terms of (6.10) decrease geometrically since (1 /4) Pixos 
= (1/4) Pig +1. Now it follows from (2.6) and (6.7) that, at least for & suffi- 
ciently large, 


so that by (6.10) 


Again, if kxk(1), we have 
(6. 13) Pm, s Par < 
Using (6.12) and (6.13) we have 


bed} 
therefore it follows from (6.8) that 

(6.14) 
~ 


diverges. 

It will be noticed that the sequence {ms} does not necessarily satisfy the 
condition (2.20). We shall show that the divergence of (6.14) implies that of 
(6.1). It suffices obviously to show that if k=k(1), k(2), --- 


Pm, On, 


where C is a constant. Now ¢n,2¢n,- As for the exponential terms we have 
| + — + | S ma — + | |) 


Now | Q,(¢.)| <1/10 by (2.6) and (2.4). Moreover, since kk(1), k(2), ++ +, 
we have (cf. (6.9)) 


Thus the first term in (6.16) is less than 6. 
Next, using Theorem 1, formula (2.8), of L.T. and (2.20) we find 


(6. 16) 
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so that also the second term in (6.16) is bounded. To appraise the last term we 
note that the derivative of Q,(x) is of the order of magnitude of \,. Thus, 
using the mean-value theorem we obtain by means of (2.6), (2.4), and (6.17) 


om, | Om,(m,) Om,(bn,) | < (dmg < 1, 


which proves (6.15) and, accordingly, the divergence of (6.1). 

The same argument applies in the case of convergence. 

7. Proof of Theorem 6. By virtue of Theorem 2 it suffices to prove: If 
(é) is a non-decreasing function, (t)>2, then the integral (2.15) converges if, 
and only tf, 


2 
(7.1) 
$ 


converges. Since (1/t)p(t)e~“/# is nonincreasing we have 


t n=k+1 t 
n 2, —(1/2 
> > /2) 
n=k+1 
so that the convergence of (2.15) implies the convergence of (7.1). Similarly, 
if (2.15) diverges so does 


2 


(7.2) 


n—l 


We have to show that the divergence of (7.2) implies that of (7.1). First let 
us observe that 
= Sanat on < (1/10')s;, 
or 
2 2 
(7.3) Sn—1 > (1/2) Sn. 


Furthermore, let », denote those integers for which 


(7.4) > O(Sy-1) + 


Since this implies that $*(s?,) >¢*(s0) +2k, which, together with 
(7.3), shows that 
2 
(1/2) (63-1) Ee. 


| 
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Thus, by assumption, the partial sum in (7.2) extended over n #1, v2, - - - 
diverges; however, for these n 
2. 


< 2€ 


which completes the proof. 
8. Proof of Theorem 11. Writing, for abbreviation, 


(8.1) nn = 2(log log s,)'/? 
we have from (2.4) that 
(nn/2) {1 + Qu(nn) } 


so that 
2 


(log log a)” 
log*/? s,, 
As }>°o2 diverges, it is seen that the right-hand member of (8.2) is conver- 


gent(!7). Now, by (2.25), the sequence {nn} satisfies the condition of Theo- 


rem 1. Hence {7,} EU. 
The convergence of (8.2) implies that the two series 


(8.3) exp {— (1/2)¢.[1 + Qn(¢n) }} 


and 


(8.4) > exp {— (1/2)¥a[1 + ]} 


either both converge or both diverge: as a matter of fact, those terms in 
(8.3) which are different from the corresponding terms in (8.4) form a sub- 
series of the left-hand member in (8.2). 

On the other hand, if { Vn} EU, then also {on} EU, since ¥, $¢,. Again, 
and accordingly, if the inequality S,>s,. is satisfied for 
infinitely many 2, this will be the case for such for which yy, =¢,. Hence, if 
{vn} EL, also EL. 


APPENDIX 


The purpose of this appendix is to show how the previous results can be 
extended, in many cases, to sequences of unbounded variables X,. The main 
application is, of course, to the case where all X, have the same distribution 


(27) Cf. Knopp, loc. cit. 
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function (cf. the corollary at the end). As a typical example we consider only 
Theorem 2. The method applies also to the remaining theorems. 

From now on the variables { X,} are no longer necessarily bounded; how- 
ever, we still assume that the second moments (1.3) are finite and that (1.4) 
holds. 


THEOREM. Suppose that for some constant M>0 


(1) = (log log f x*dVi(x) 


k z|>M (log log 


Then CU(L) if, and only if, 


(Needless to say the criterion of Theorem 6 also holds.) 


Proof. It follows from the last two paragraphs of §8 that it is sufficient 
to prove the theorem for sequences {¢,} such that 


(3) on 2(log log s,)*/?. 


f xdV 9(x) 
| (log log 


Xn— a, if | S M sa/(log log s,)*/2, 
if | X,| > M s,/(log log s,)*/2, 
if | X, | S M s,/(log log s,)*/?, 
if | X,| > M s,/(log log s,)*/?. 
Obviously 
(5) X, = Xn +X. + ap. 
Now 
Per (xX. => dV,,(x) 
n n | 2|>M (log log 
1 > (log log s,,)* 


x*dV,(x) < 


so that at almost all points 


(6) | = O(1) = 
Next, by (4), 


| 
2 

| 
| 
Put | 

| 

| 

| 

| 

| 

{ 

| 

| 

| 

| 

| 
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(7) | a, | S Ms,/(log log s,)*/?, 
and therefore by definition 
(8) | < 2Ms,/(log log s,)*/?. 


The variance of X#* is given by 


*2 
= 


(x — dVi(x) + on f dV (x) 


| (log log 


(9) = x dVi(x) — as 


Now it follows from (1.4) and (1) that 
(log log s,)*/? * ™ (log log s,)*/? 


n k=1 


f | «| dV s(x) 
Sk | z|>M (log log 


1 2 (log log f 
Mien | 21>M (log log 
= O(1). 


x(x) 


Accordingly 


(10) = o(— 


kal (log log s,)*/? 
and, by (7), 


(log log s,) 


Similarly 


f 
| (log log s,)*/? 


2 3 
Sa (log lo 
(log log ss) x*dV 


s 
(log log | 2|>M (log log 
2 


Combining this with (3), (9), and (11) we have 


[November 


1943] LAW OF THE ITERATED LOGARITHM 


2 2 
#2 2 Sn 2 Sn 
12 = = Sn, O = S, O 
This implies that the two sequences { on} and { (sn/s*)bn} belong to the same 
class with respect to {X,*}. Now, if {¢.} €.C, at almost all points the in- 
equality 


(13) = p> XE > + C(Sn/Sabn) } 
1 


will be satisfied for any C>0 for infinitely many m so that {¢,.} EL also 
with respect to {X,}. A similar remark applies if {¢.} GU. In view of (5), 
(9) the last inequality implies that also S,>s,@, for infinitely many n. This 
finishes the proof. 


COROLLARY. The criteria of Theorems 2 and 6 are valid if there exist two 
constants 6>0 and A such that 


+00 
(14) f x | log | x| < Aor. 


In particular, this is the case if Vi(x) = V(x) and 


+00 
(15) f x?| log | x| < @. 


In fact, if (14) holds, the series (1) is (at least for sufficiently large k) 


majorated by 
on 


ch 


s? logi+#/2 5, 
which series obviously converges. 
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REGULARITY OF LABEL-SEQUENCES UNDER 
CONFIGURATION TRANSFORMATIONS 


BY 
T. N. E. GREVILLE(*) 


1. Introduction. It is the purpose of this paper to develop a set of trans- 
formations to be applied to a label-sequence, or a set of such sequences, 
yielding a single, new sequence, and to show that certain essential properties 
of the sequences are almost always preserved under such transformations. 
This result, like other less general theorems previously given by other writers, 
provides a new kind of justification for the use of the classical rules for the 
combination of probabilities. These transformations are defined by setting 
up certain correspondences between configurations of elements in the original 
sequences and single elements in the sequence resulting from the transforma- 
tion. The set of transformations considered will be shown to include as special 
cases the four fundamental operations in terms of which the rules for the com- 
bination of probabilities have been formulated analytically by von Mises(?), 
as well as the operations used by Copeland, Reichenbach, and Popper in 
defining classes of canonical sequences. 

The concept of the configuration transformation was first suggested by a 
consideration of the problem of “sets of games.” A limited application of the 
principle to certain simple cases was made in an earlier paper in which it 
was stated, on the authority of von Mises himself, that his methods were not 
applicable to the formulation of such problems(*). He has since pointed out(‘), 
however, that the general “set-of-games” problem can be handled by means 
of a suitable combination of the operations of selection, mixing, and com- 
bination. A more general class of problems can be dealt with through the 
transformations of this paper; and, in any case, it is convenient to work with 
a single type of operation which includes all the others as special cases. More- 
over, it simplifies the treatment of complicated problems to express them in 


Presented to the Society, September 8, 1942, and also April 7, 1939 under the title Invars- 
ance of the admisssbility of variates under certain general types of transformations; received by the 
editors January 29, 1943, and, in revised form, March 31, 1943. 

(?) A dissertation submitted in partial fulfillment of the requirements for the degree of 
doctor of philosophy in the University of Michigan. The writer wishes to express his apprecia- 
tion to Professor A. H. Copeland for his valuable assistance in the preparation of this paper, 
and to Professor R. von Mises for permitting him to read the manuscript of an unpublished 
paper [3]. Numbers in brackets refer to literature cited at end of paper. 

(*) von Mises [1, pp. 75-94]. 

(*) Greville [1, p. 410]; von Mises [1, pp. 108-109]. 

(*) von Mises [2, pp. 195-197; 3]. 
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terms of a single operator rather than a large (and frequently infinite) num- 
ber. 

My results as to the invariance of certain properties of sequences under 
configuration transformations include as special cases most of the theorems 
previously proved on the existence of the collective. For the methods used in 
arriving at these results, I am much indebted to Kolmogoroff, Feller, and 
Doob(°). 

2. The configuration transformation. Let [m, 72, --- ] be a set (finite or 
denumerable) of arbitrary sets, each containing more than one element, which 
will be called label-spaces. Let x=[x:, x2, - ++ ] be a set of infinite label- 
sequences, x;=x{”, x”, - - - , each sequence x; being composed of elements 
of the set 7;. 

A transformation on the set x will be defined by means of a configuration 
function(*) f(c), in which c denotes a finite configuration of elements of the 
product-space =71Xm2X ---. It is assumed that f(c) is defined for some, 
but not necessarily all, finite configurations of elements of 7. The “values” 
of f(c) may be any arbitrary elements. 

From the sequence of configurations: , in which is the con- 
figuraticn formed by the first r elements of all the sequences x;, select the 
subsequence ¢,,, ¢,,, * * + consisting of those configurations c, for which f(c,) 
is defined. Finally, form the sequence y= 7(x) =f(c,,), f(r), . This de- 
fines the configuration transformation T. The label-space p, consisting of all 
possible “values” of f(c), will be assumed to contain more than one element. 

As an example of a configuration transformation, consider two players 
simultaneously throwing dice, who agree that A is to win a bet if he throws 
a one before B throws a six. Otherwise, B wins; but if both throw their num- 
bers.on the same toss, it is not counted. Here, 7; and 73 each consist of the 
integers 1 to 6, and 7 consists of all the possible pairs of such integers, while p 
consists of the two labels A and B. The configurations for which the function 
JS(©) is defined are those finite sequences of pairs of integers 1 to 6 terminating 
with a pair whose first member is 1 or whose second member is 6, but not 
both. f(c) has the value A in the former case, B in the latter. 

The four fundamental operations of von Mises, place selection, partition, 
mixing, and combination, are all particular types of configuration transforma- 
tions. The first three are all operations on a single sequence. A place selection 
can be defined(7) by means of a configuration function g(c) defined for all 
finite configurations, and restricted to the values 1 and 0. A sequence x’ is 
selected from the sequence x by adopting the rule that any element x‘® is 


(*) Kolmogoroff [1], Feller [1], Doob [1]. 

(®) Configuration functions have previously been used by Wald [1], Doob [1], Feller [1], 
and Ville [1], as a means of defining position selections. 

(*) Wald [1, pp. 38-39]; Doob [1, pp. 364-365]; Feller [1, p. 89]; Ville [1, pp. 41-42]. 
See also von Mises [1, p. 75]. 
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selected if and only if g(x, x, - - - , x(*-) =1. This operation can be re- 
garded as a configuration transformation by merely including the element 
selected in the configuration, considering the label-spaces w and p as identical. 

The operation of mixing(*) consists in grouping together certain sets of 
labels and agreeing to represent all the members of a given group or set by 
a single label. The operation of partition consists(*) in selecting from a se- 
quence x a subsequence composed of all those elements belonging to some 
subset y of x. The operation of combination(*) consists in combining a set 
(finite or denumerable) of sequences [x:, x2, --- ] into a single sequence 
whose elements are points in a multi-dimensional space having as coordi- 
nates the corresponding elements of the individual sequences. All these opera- 
tions are clearly particular cases of the configuration transformation. 

3. A measure theory. Every set of label-sequences x = [x:, x2, - + - ] may 
be regarded as a point of the infinite product-space I =x XX - - - . In order 
to facilitate the study of the behavior of sequences under configuration trans- 
formations, a measure theory on II will be developed. It will be assumed that 
there exists a field , consisting of subsets of and including 7 itself, and a 
non-negative, additive set function p(y), defined on ®, such that p(7) =1. 
It will be assumed further that p(y) is absolutely continuous: that is, if there 
is a monotone sequence 71)2_ ~ -.- approaching the null set as limit, then 


howe = 0. 


According to a well known extension theorem of Banach("'), a set function 
p(y) having the above properties can be extended into an absolutely additive 
set function defined on a Borel field ®’(!*). The function so obtained will be 
called the Borel extension(*) of the function (7). 

We shall follow Kolmogoroff in calling the set 
- +--+, where each is an element of 2, a cylinder-set('*) on the space II; 
and we define P(g)=p(A1)p(As) - - (An). Lomnicki and Ulam(**) have 
shown that a set function thus defined can be extended into an absolutely 
additive (and therefore absolutely continuous) measure function defined on a 
Borel field F consisting of subsets of II. Obviously, P(II) =1. 

If the sequences x1, x2, +--+ are independent, it is possible to begin by 


(*) von Mises [1, p. 76]. 

(*) von Mises [1, p. 86]. 

(*) von Mises [1, p. 94]. 

(") See Kolmogoroff [1, pp. 15-16], Feller [1, p. 90]. 

(2) A Borel field is one having the property that every set which is the sum of a denumer- 
able infinity of sets of the field belongs to the field. See Kolmogoroff [1, pp. 15-16]. 

(4) Kolmogoroff [1, p. 25] uses the term cylinder-set in a somewhat broader sense. We shall 
therefore designate as a simple cylinder-set a set of the simple product type used here. See also 
Feller [1, p. 90]. 

(*) Lomnicki and Ulam [1, p. 252]. Kolmogoroff [1, p. 27] has also proved this theorem 
for the case of a Euclidean space. See also Doob [1]. 


406 T. N. E. GREVILLE ° [November 


assuming the existence on each space 7; of a set function ;(y;), defined on a 
field ®;, having the required properties. A unique measure function p(y) on 
a field @ is then obtained by the same process used in developing the func- 
tion P(g). p(y) automatically possesses the necessary properties. 

In order for this measure on II to be of use in the study of configuration 
transformations, it is necessary to limit consideration to configuration func- 
tions f(c) which are measurable. Consider a finite product-set s=\i XAs 
xX on the m-dimensional product-space pXpX Xp. Let E, 
denote the point set on II consisting of sets of sequences x= [x:, x2, - + | 
such that the configuration formed by the first » elements of the correspond- 
ing sequence y belongs to s. Further, let Z,, denote the subset of Z, consisting 
of those points x for which the configuration ¢c associated with the element 
y™ (through the relation f(c)=y) contains m elements. If there exists a 
field VY, consisting of subsets of p including p itself, and such that, for every 
such set s formed from elements of V and for every m, the corresponding set 
Eum belongs to the field F, the configuration function f(c) is said to be meas- 
urable on V, and the transformation T is called a measurable transformation. 
In the case of a selection, this definition can be shown to be equivalent to 
that given by Feller("*). 

4. The regularity theorem. In studying the behavior of sequences under 
configuration transformations, we are interested in the frequency with which 
certain sets of elements appear in the transformed sequence. We shall de- 
velop a measure function on the transformed label-space p, based on the struc- 
ture of the transformation 7, which will indicate what frequencies are to be 
expected for these sets on an a priori basis. The measure theory already de- 
veloped on II will make it possible to determine whether our expectations are 
satisfactorily met. In general we shall consider this to be the case if the set of 
untransformed sequence-sets not producing the proper frequencies in the 
transformed sequence is of measure zero on II. In order to secure a suitable 
measure function on the transformed space, it is necessary to impose certain 
further (although as shown later, not too severe) limitations on the trans- 
formations which may be considered. 

A measurable configuration transformation T will be called endometric if 
there exists an additive set function g(5), defined for every set 5 of Y, such 
that g(p) <1, and having the property that, for every finite product-set 
XA, formed from sets of ¥, 


(1) P(E.) & gn). 
The function g(5) will be called the expected frequency of the set 5. Let 
(5) We shall always use the ordinary multiplication sign to indicate a multi-dimensional 


product-set and the dot to indicate the set-product or common part of a group of sets. 
(*) Feller [1, p. 90]. 
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6,(5, y) denote the number of elements, out of the first » elements of y, which 
belong to 6. 


THEOREM 1. If T is an endometric configuration transformation, and 6 is 
a set of the field V, then, for almost every point x of Il, either the sequence y = T(x) 
terminates after a finite number of elements, or else(*") 

(2) lim y) = 9(6). 

Let M,,, denote the set of points of II for which the sequence y= T(x) 
contains at least m elements and exactly & of the first » elements belong to 6. 
Let 7 denote the set p— 6. Then, since T is endometric, g(p) =q(6) +q¢(n) $1. 
Hence g(n) $1—g(4). It follows from (1) that 


P(Ma,x) Ca,x[q(6)]*[1 — 


By a well known property of the binomial expansion("*), the measure of the 
set 


Mayr 


n=N |k/n—q(8)|>¢ 


approaches zero as WN tends to infinity, for every positive «. Therefore the 
point set on II for which the upper or the lower limit of N~'@,(5, y) differs 
from g(6) by more than e¢ has the measure zero. 

This theorem leaves something to be desired, since it implies that a trans- 
formed sequence terminating after a finite number of elements is quite as 
satisfactory as an infinite sequence having the expected frequency for the 
set 5. By imposing certain additional requirements on T, this defect can be 
eliminated. A measurable transformation T will be called complete if, for al- 
most every point x of II, the sequence y=7(x) contains an infinite number 
of elements. A transformation will be called holometric if it is both endometric 
and complete. A point x on II such that the sequence y = T(x) does not termi- 
nate, and such that (2) holds for every set 5 of Y will be called a regular point 
under T. A set function f(a) defined on a field A will be called denumerably 
ordered if there exists a denumerable set of sets 0, (k=1, 2, - - - ), all con- 
tained in A, such that for every set a of A(*), 


(3) f(a) = G.L.B. f(bs) = L.U.B. f(b). 


The following theorem follows immediately from Theorem 1 and from the 
above definitions: 


(*7) This theorem was proved by Feller [1, pp. 89-93] for the case in which T is a selection. 

(**) Borel [1, p. 4] Feller [1, p. 93]. 

(*) This condition was used by Feller [1, p. 89]. G. L. B. denotes “greatest lower bound”; 
L. U. B., “least upper bound.” 
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THEOREM 2. If a configuration transformation T is holometric with respect 
to a function q(5) which is denumerably ordered on a field V, then almost every 
point of II is regular under T. 


5. Iteration of configuration transformations. It follows from the defini- 
tion of a configuration transformation that if y= 7(x) and z=T7’(y), T’ being 
one-dimensional, then there exists a configuration transformation 7’’ such 
that z=T’’(x). Therefore, any finite sequence of configuration transforma- 
tions in which all but the first are one-dimensional can be replaced by a single 
configuration transformation. 

In order to discuss the behavior of y under T’, it is necessary to define a 
satisfactory measure function on the label-space p. The requirement that T 
be endometric is not quite adequate, as it does not impose sufficient limita- 
tion on g(6) to permit its use as a measure function. For this reason, another 
special type of configuration transformation will now be defined. An en- 
dometric transformation will be called ésometric if, for every product-set 
s=hiX\2X XA, formed from sets of ¥, 


It will be shown later that every holometric transformation is isometric 
(see Theorem 4). Therefore, in the case of a holometric transformation, the 
function g(6) meets all the requirements for a satisfactory measure function, 
and can therefore be extended, as indicated in §4, into an absolutely additive 
set function defined on a Borel field V. It is then possible to define a measure 
function Q(d) on a field G consisting of subsets of the infinite product space 

Returning to the discussion of iterated transformations, it will now be 
shown that if T is holometric and 7” is measurable, then T’’ is measurable. 
Let o denote the label-space associated with the sequence z, and let 
S=1XpueX - ++ Xu, denote a finite product-set on the n-dimensional prod- 
uct-space oXo@X --- Xo. Further, let Hg denote the set of points y on R 
such that the configuration formed by the first # elements of z belongs to S; 
and let Hs= denote the subset of Hs consisting of those points y for which 
the configuration ¢ associated with the element 2‘ contains m elements. 
Since T’ is measurable, there exists a field 2 composed of subsets of o, such 
that for every m and for every product-set S formed from elements of Z, 
Hs» is measurable on R. Now, there corresponds to every sequence y the set 
of sequence-sets x such that 7(x) =y. Similarly, there corresponds to every 
set of points y on R a set of points x on II. If K denotes any point set on R 
and L the corresponding point set on II, we shall presently show that L is 
measurable provided K is measurable, and moreover P(Z) = Q(K). But if the 
set Hs» be taken as the set K, the set L is the set Zs» associated with the 
configuration S under the transformation T’’. This not only shows that T’’ 
is measurable if 7’ is measurable, but also that if T’ is endometric, complete, 
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holometric, or isometric, then T’’ has the same property. 

To prove the above statement regarding corresponding point sets on the 
spaces R and I], first consider the case in which K is a simple cylinder-set 
+ KAnXpXpX each A; being an element of VW’. In this 
case, it follows from (4) that P(L) =Q(K), since 7, being holometric, is there- 
fore isometric (see Theorem 4). Now, the measure function Q(d) on the en- 
tire field G was secured by extension from its definition for simple cylinder- 
sets. Moreover, Lomnicki and Ulam have shown that this extension is 
unique(?°), As the two measure functions P(g) and Q(d) have the same 
laws of combination, the same equality must hold for all measurable sets. 

A sequence of configuration transformations applied in succession to a 
sequence-set x, so that we have T,{ - - - 7:[71(x)]} =z, will be called a chain. 
It follows, of course, that all the transformations of the chain, with the pos- 
sible exception of 7), are one-dimensional. The following theorem about 
chains is an immediate consequence of the above remarks. 


THEOREM 3. A chain of configuration transformations in which the final 
transformation is measurable, and all the others holometric, is equivalent to a 
single, measurable configuration transformation. If the final transformation of 
the chain is endometric, complete, holometric, or isometric, then the equivalent 
single transformation has the same property. 


6. Classification of configuration transformations. Consider the division 
of any configuration c for which f(c) is defined into two segments or sub- 
configurations which shall have the following properties. The first segment 
shall either be null, or else shall constitute a configuration for which f(c) is 
defined. The second segment shall not be vacuous but otherwise shall contain 
as few elements as possible. It is clear that such a division is always possible, 
and is unique. The two segments so obtained will be called the initial segment 
and the final segment of c. If a configuration function f(c) is a function of the 
final segment only of c, the corresponding transformation T is called sym- 
metric. The transformation associated with the “set-of-games” problem is 
evidently symmetric, as are all mixings, partitions, and combinations. 

On the other hand, place selections are not, in general, symmetric. An 
example is a type of selection employed by various writers(?!) in defining 
special classes of canonical sequences, an operation which selects every ele- 
ment.in a sequence which is preceded by a specified configuration. Here two 
configurations on which selections are based may overlap, in which case the 
transformation is not symmetric. It is easily verified that the class of sym- 
metric transformations possesses the iterative property previously mentioned. 

The following are the principal theorems regarding the classification of 
configuration transformations: 

(°) Lomnicki and Ulam [1, pp. 250-251]. 

(2) Reichenbach [1, p. 291], Popper [1, p. 106], Ville [1, p. 70]. 
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THEOREM 4. Every holometric configuration transformation is isometric. 


Consider the finite product-set s=\1 XA2X + - - XA, in which each set A; 
is an element of the field V associated with the transformation 7; and let 
Then if s’=(Ai+u1) X X X(An+ua), the correspond- 
ing point set EZ, on II has unit measure, since the only points x not contained 
in this set are those for which 7(x) terminates with fewer than m elements. 
Since T is complete, the set of these points is of measure zero. Now, the 
product s’ can be expanded algebraically into a sum of 2* mutually exclusive 
product sets, $1, S2, - - - , Ss». In order to have the sum of their measures unity, 
the measure of each of the corresponding sets, E,,, E.,, + + +, Ean (Of which 
one is the set Z,) must have its maximum value according to (1). 


THEOREM 5. Every measurable symmetric transformation is isometric. 


Condition (4) can be satisfied, for the case »=1, by defining g(5) = P(E;). 
Assume that (4) holds for all (n—1)-dimensional product-sets, and consider 
the set s=A1 KA2X XKA,. Let s’ denote the set XA2X KAg-1. Now, 
the set E,-, can, by definition, be expressed in the form K,XaXarX---, 
where K,, is a subset of the m-dimensional product-space XX -- + Xz. 
Let E; denote the set of points x such that the first element of y is an element 
of 5, 5 being any subset of p. Then, Eym-E,=Km XE ,, where the latter 
expression represents the set of all points x (regarded as sequences of points 
of the space 7) which can be formed by prefixing to any sequence x belonging 
to the set E,, the configuration formed by the m elements constituting any 
point of K,,. It follows from the manner in which the function P(g) was con- 
structed that But and 
since, by definition, the sets E,-, for different values of m are mutually ex- 
clusive, P(E.) =o This completes the in- 
duction. 


THEOREM 6. Every measurable selection is endometric(?*). 


Since, in the case of a selection, the label-spaces 7 and p are identical, 
take q(y) = p(y). First, consider the case n=1, where s=),. If x belongs to 
Em, it follows from the definition of a selection that the first m—1 elements 
of x determine whether or not the mth element will be selected, and that the 
only role of the mth element, so far as membership in the set E, is concerned, 
is in determining whether the element selected belongs to the set A. There- 
fore, the set E,, can be expressed in the form Kn1X7 XX ---, where 
Ki is a subset of the (m—1)-dimensional space XX --- Xz. Then, 
where L denotes the infinite product-set ---: 
that is, the set of all sequences x having as the first element a point of Ax. 


(*) Feller [1, pp. 92-93] gives a similar proof for the case in which each component 
of s is either a particular set y or the complement x—vy. 
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Then, P(Eum) =P(Exm)p(1). Therefore, 


This proves the theorem for the case n=1. 

Now suppose the theorem to be true for all (7 —1)-dimensional product- 
sets and consider a product-set s of m dimensions. Let s’ denote the product- 
set obtained by replacing the final element X, of s by z, and let s’’ denote 
the (n—1)-dimensional product-set obtained by deleting this element. Then, 
by reasoning similar to that employed in the case = 1, P(Eum) = P(Eum)P(An)- 
Therefore, 


P(E.) = = P(Evrn) On) = P(Ev)p0x)- 


Since E, is clearly a subset of E,, P(E,)SP(E,-), whence P(E,) 
SP(E.)qQn). But, by hypothesis, (1) holds for the product-set s’’. This 
completes the induction. 

The properties of the four fundamental operations of von Mises are of 
particular interest. First considering measurability, every combination is 
measurable if there exists a measure function on 7 satisfying the require- 
ments previously laid down, for the field Y can then be chosen as identical 
with ®. The partition is measurable, provided the set y on which the selec- 
tion is based (see §2) belongs to the field ®, for in this case the set y together 
with all its subsets which are also elements of ® constitutes a field YW which 
is identical with a subfield of &. The measure function g(4) is defined by 


q(5) = p(6)/p(y). 


The mixing is measurable provided there exists on the space p a field V into 
which some subfield of © is transformed by the mixing. No general statement 
can be made regarding the measurability of place selections. 

The mixing, partition, and combination, being all symmetric transforma- 
tions, are, by Theorem 5, isometric when measurable. Every measurable 
mixing or combination is evidently complete. Finally, all measurable parti- 
tions are complete (unless p(y) =0), since it is easily shown that the set of 
points x on II containing only a finite number of elements of ¥ is of measure 
zero. 

7. Illustration. An illustration of the application of configuration trans- 
formations is to be found in the definition given by Copeland(™) for an ad- 
missible variate. Let the space « consist of all real numbers, and let p consist 
of the two labels 0 and 1. Let nr, re, - - - , 7x, m be a set of integers such that 
+--+ and let Ih, Ie, be a set of real intervals, 


(*8) Copeland [1, pp. 543-547]. 
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open on the left and closed on the right. The configuration function f(c) will 
be defined for those and only those configurations in which the number of 
elements is an integral multiple of m. Its value is determined by the last 
elements of the configuration, and is 1 if and only if the r;th of these last ” 
elements falls in the interval J; for each i (¢=1, 2, - - - , &). We shall call any 
transformation T thus defined a Copeland transformation. If F(t) is any mono- 
tonic function such that F(— ©)=0 and F()=1, a label-sequence «x is said 
to be an admissible variate with respect to the function F(#) provided the 
relative frequency of the label 1 in the corresponding sequence y = T(x) tends 
to the limit []#.,/rdF(t+0), for every possible choice of the integers 
1, * fe, m and of the intervals J;. 

Given any function F(#) satisfying the above conditions, Copeland shows 
that there exists an admissible variate with respect to F(#). We shall show 
that almost every variate x is admissible with respect to any such function 
F(t). Let the field © consist of all real intervals J open on the left and closed 
on the right, and all finite sums of such intervals; and for each such interval J, 
let p(I) = frd F(¢+0). Consider the Copeland transformation T resulting from 
a particular choice of the integers ri, r2, - « - , fx, m and the intervals J;. This 
transformation is clearly measurable, and being symmetric and complete, is 
therefore holometric. By Theorem 2, almost every sequence x is regular under 
any denumerable set of Copeland transformations. As F(t) is a monotonic 
function, it has at mrost a denumerable set of discontinuities. Let the set G 
consist of all those intervals J whose end points are either rational points or 
points of discontinuity-of F(#). Now let the set D consist of all possible Cope- 
land transformations based only on intervals belonging to G. D is then a de- 
numerable set, and almost every sequence x is regular under every transfor- 
mation of D. But such a sequence is regular under all Copeland transforma- 
tions, since, for any interval J, it is possible to select a sequence of intervals 
hh, Is, + , all belonging to G, such that 


lim dF(t + 0) f are +o. 
moo JT, I 
8. General remarks. Although the theorems of Wald(*‘) are more general 
than those of this paper in that they apply to all selections, no measure theory 
is developed for appraising sets of sequences, and transformations other than 
selections are not considered. Feller(*) avoids the assumption of absolute 
continuity and the limitation to measurable selections by constructing a 
modified (and somewhat arbitrary) measure function in terms of which all 
configuration functions are measurable. It is not known whether this prin- 
ciple can be extended to transformations other than selections. 


(*) Wald [1]. 
Feller [1]. 
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There are certain interesting non-endometric transformations, which 
nevertheless possess the regularity property of Theorem 2. No necessary 
and sufficient condition for this property is at present known. Nor is it known 
whether the general (non-symmetric) configuration transformation can be re- 
placed by a set of operations of the von Mises types. 
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THE THEORY OF ANALYTIC FUNCTIONS IN NORMED 
ABELIAN VECTOR RINGS 


BY 
EDGAR R. LORCH 


Introduction. The problem of determining the broadest domain over 
which the classic results of analytic function theory are valid has attracted 
attention continuously since the time of Hamilton’s essays in that direction 
with quaternions(!). The contributions to this subject converge from many 
directions; matrix theory, the theory of hypercomplex numbers, the theory 
of integral equations, and others, provide the frame for the investigations(?). 
It is established in the present paper that classical function theory can be 
extended to normed abelian vector rings. This result was known for finite 
dimensional rings(*). The infinite dimensional case requires for its elaboration 
certain basic informat ..1 concerning the structure of bounded linear trans- 
formations. A surprising feature of the present development is that the meth- 
ods as well as the results of the classic theory can be carried over almost 
without change(‘). 

In the preliminary section are derived the elementary properties of the 
rings here considered, Then the Cauchy theory is elaborated. The theory of 
ideals is examined insofar as it bears strongly on present questions. Use is 
made frequently of the notion of maximal ideal; the existence and principal 
properties of these were established by Gelfand(*). Illustrative of the results 
here obtained are Theorems 6 and 9. The exponential and logarithmic func- 


Presented to the Society, February 28, 1942, and April 24, 1943; received by the editors 
November 23, 1942. 

(*) In his Vorlesungen iiber die Entwicklung der Mathematik im 19. Jahrhundert (Julius 
Springer, Berlin, 1927), Felix Klein writes: “Das letzte Ziel (of quaternion theory) war—und 
ist—eine quaternionistische Funktionentheorie, von der man ganz neue, gewaltige umfassende 
Aufschltisse fiir die ganze Mathematik erhoffte. Zur Férderung dieser zwar nicht deutlichen 
aber glatibig angenommenen Ziele wurde gar 1895 ein ‘Weltbund zur Férderung der Qua- 
ternionen’ gegrtindet.” See vol. 1, p. 189. 

(?) A large though necessarily incomplete list of references is given by J. A. Ward, A theory 
of analytic functions in linear associative algebras, Duke Math. J. vol. 7 (1940) pp. 232-248. 

(®) G. Scheffers studied functions analytic over commutative and associative algebras with 
a finite basis—Verallgemeinerung der Grundlagen der gewéhnlichen komplexen Funktionen, 
Leipziger Berichte vol. 45 (1893) pp. 838-848; vol. 46 (1894) pp. 120-134. Scheffers’ results 
were extended by P. W. Ketchum, Analytic functions of hypercomplex variables, Trans. Amer. 
Math. Soc. vol. 30 (1928) pp. 641-667. 

(*) A very characteristic change in the present development is that the equality for complex 
numbers |a-5| =|a| -|5| is replaced by the inequality in rings |a-b| <|a| -||. It can readily 
be seen that certain classic proofs of function theory (convergence of integrals and series) are 
not affected by this change. 

(®) I. Gelfand, Normierte Ringe, Rec. Math. (Mat. Sbornik) N.S. vol. 9 (1941) pp. 1-24. 
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tions are discussed. The importance of considerations of reducibility here 
may be judged from the fact that exp (z) is simply periodic if and only if 
the ring is irreducible. The problem of embedding an element in a continuous 
multiplicative group is treated completely in Theorem 11. Finally, separation 
in a ring by a rectifiable curve is discussed in the last section: separation and 
irreducibility are interrelated concepts. The present paper is based on an 
analysis of spectra of linear transformations which the author has recently 
published(*). At certain points theorems are stated without proof as similar 
results appear elsewhere in the literature. In this respect the work of Gelfand 
is particularly to be cited. There is interest in noting that certain of his results 
(for example the theory of the radical) may be obtained equally from our 
work by the simple device of obtaining a representation of the given abstract 
ring by means of a ring of transformations (Theorem 1). 

Vector rings. A collection ® of elements a, b, z,--- is called a normed 
abelian vector ring if("): 

(a) ® is a normed linear complex vector space. 

(b) ® is a commutative ring with unit e. 

(c) The norm function defined over ® satisfies the inequality 


(1) |a-b| S| a| -|d}. 


Elementary properties of these rings will be derived rapidly. The inequal- 
ity |ab—ac| <|a| -|b—c| shows that multiplication is continuous in the to- 
pology defined in ® by the norm. If a~! represents the reciprocal (in trans- 


formation theory, the inverse) of a, the equation 
(a — = a! + + + 

valid for |z| <|a-'|-!, shows that the elements in 9 possessing an inverse 
form an open set. It also shows that inversion is a continuous (indeed, ana- 
lytic) procedure. If a—' exists, a is called regular; in the contrary case, a is 
called singular. The regular elements form a topological group G relative to 
multiplication. The set G, being open, is the point set sum of disjoint maximal 
open connected sets Gz, G=).«Ga(*). That component containing the unit e 
will be denoted by G; and will be called the principal component of G. If 
a€&Gi, bEG,, then ab€G, and a—'€G, that is, G; is a group. The set F of 


(°) The spectrum of linear transformations, Trans. Amer. Math. Soc. vol. 52 (1942) pp’ 
238-248. This paper will be referred to as S. In preparing S, the author unfortunately relied 
too heavily on the literature of the past decade to furnish him with a list of references. For 
that reason it was only after the present manuscript, was submitted that he noted the brief 
but pointed results of F. Riesz in initiating this theory. These are to be found in his book, 
Les systémes d’ équations linéaires, Paris, 1913, pp. 117-121. I hope Professor Riesz will accept 
my sincere apologies. 

(*) This definition appears in a paper by M. Nagumo, Einige analytische Untersuchungen 
in linearen metrischen Ringen, Jap. J. Math. vol. 13 (1936) pp. 61-80. 

(8) An example of a ring in which the multiplicative group G is not connected is the ring Rs 
of the example discussed after Theorem 12. 
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singular elements is closed. If $ is an ideal, $¥R, then §CF. If J=F, 3 is 
the radical of R (see Theorem 6). 
If a is fixed in R, 2 arbitrary, the mapping 


T.2 = az 


establishes a correspondence a—T, between § and a class of bounded linear 
transformations defined over ®; this correspondence is called the regular 
representation of R. 


THEOREM 1. The mapping a->T, of the regular representation of R is an 
algebraic and topological isomorphism. 


This theorem enables one to apply the theory of linear transformations 
to ®. Thus one may say, “the complex number A is in the spectrum of a” 
instead of the more cumbersome “A is in the spectrum of T,”(°). 

The ring ® is reducible if R is the disjoint sum of two rings each distinct 
from R, R=Ri+MR:e. More precisely, this means that if a@ER, there exists a 
unique pair a2€ M2 such that a=a,+42, a1a@2=0. Reducibility implies 
and is implied by the existence of an idempotent j, j?=j, with 70, j~e. The 
rings ®; and ®: are given by Ri=jR, R2=(e—7) RM where e—j is also an idem- 
potent. A fundamental characterization of irreducible rings is given in the 
following theorem. 


THEOREM 2. The ring ® is irreducible if and only if the spectrum of every 
element in R is a connected set. 


If R is reducible, and if 7 is an idempotent in R, je, 70, the spectrum 
of j consists of the two points \=0, A\=1, a disconnected set. Now let a be 
any element in ® whose spectrum is not connected. Let C be a simple closed 
rectifiable curve in the resolvent set of a such that points of the spectrum of a 
lie both interior and exterior to C. The integral 


(2) 
J 2ridc fe—a 
evaluated over C in the counterclockwise sense has the property j?=7(*). 
By S, Theorem 8c, d, p. 244, 70, je, hence ® is reducible. 
Clearly, if ® is finite dimensional, it is the direct sum of irreducible rings. 
This is not the case for infinite dimensional rings. Irreducible rings are funda- 
mental in function theory. For example, the ring of continuous complex 


(*) Throughout the paper, complex numbers will be denoted by Greek letters a, y,{,, °° +. 
It is worth while to compare certain expressions of abstract algebra and the corresponding ex- 
pressions in transformation theory. Thus “a is a null-divisor” is equivalent to “A=0 is in the 
point spectrum of a.” 

(**) See S, Theorem 6, p. 242. That the spectrum can be separated by a curve in case it is 
not connected is discussed under Spectral components, p. 245. 
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functions defined on a finite closed real interval, or the ring of functions 
analytic and bounded over a region of the complex plane are irreducible(**). 

It is important to know what (complex) rings # may be fields. The answer 
is provided by Theorem 3. It is also interesting to determine what rings have 
the property |a-b| =|a| -|5|. This is settled by Theorem 4. 


THEOREM 3. If R ts a field, it is the field of complex numbers(}*). 


THEOREM 4. If in R the norm function satisfies |ab| =|a|-|b| then R is 
the field of complex numbers. 


Using the fact that |e] =1, one proves that if bC©® is regular, the spec- 
trum of 6 lies on the circle || =|]. Since this is also true of b—ye where 
|u| is small, the spectrum of 5 consists of one point. Applying S, Theorems 
8d and 7, it may be shown that 3 is a scalar. 

Analytic functions. Although most proofs will be omitted in this section, 
it will be valuable to set down salient steps in sufficient detail so that the char- 
acter of the development is clearly apparent. The purpose of this is to show 
that not merely the results but also the methods and even the appearance of 
the classic theory may be transferred to ®. 

A variable element in ® will be denoted by z. A function defined over a 
region in 9 and having values in ® will be denoted by f(z). Continuity of f(z) 
is defined in evident fashion. The topology used throughout is that of the 
norm, that is, the uniform topology. A function z=2(#) where 2(¢) is a con- 
tinuous function defined on a real interval 0 S$# 31 defines a curve. The curve 
is rectifiable if the sums Zeal 2(t;) —2(t;1)| are bounded; ..ere ” is arbitrary 
and 0=4#S43S --- St,=1. It may be shown that if f(z) is continuous in a 
region H and if C is any rectifiable curve, closed or not, in H, the Riemann 
integral /¢f(z)ds exists and has the usual properties. 


DEFINITION. A function f(z) ts said to have a derivative f'(2o) at <=2 tf for 
any €>0a 5>0 can be found such that for all hin R with |h| <6, 


(3) | f(zo + h) — f(zo) — S| hl «. 


If f(z) has a derivative throughout a neighborhood of z=2, f(z) is said to be 
analytic at 2p. 


(#) Let ® be the ring of complex valued continuous functions f(#), OS#S1, with | f | 
=max|f(t)|. The intimate connection of the concept of irreducibility with some fundamental 
theorems of function theory will be illustrated: the spectrum of f(¢) consists of the set of values 
assumed by f(é). If R is assumed irreducible, this is a closed connected set. If f(#) assumes 
real values only, the set is a real closed interval. Thus f(¢) assumes its maximum (or minimum) 
on 0S#31. Also, f(¢) assumes every value between these extremes. 

If Q is the ring of functions f(s) analytic and bounded on |s| <1 with IZ| =lL.u.b. |f(s)|, 2 
is a normed abelian vector ring. Since |s| <1 is a connected set and f(s) is continuous the spec- 
trum of f(z) is connected and © is irreducible. 

() Theorems 3 and 4 have been given by S. Mazur, C.R. Acad. Sci. Paris vol. 207 (1938) 
p. 1025. 
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The Cauchy theorem may be established for functions analytic in a con- 
vex region (or in a region which is the sum of a finite number of convex 
regions). As in the classic case it is first established for triangles. If a and b 
are in the line segment joining and is given by 0S#31. 
The interior of the triangle determined by a, b, ¢ consists of the points 
Aa+pyb+vc, X+u+v=1, v>0. The proof now proceeds in stand- 
ard fashion. Incidentally, (3) is required in the form there given and not in 
the quotient form | Aw/Az—f’(z0)| S¢ normally employed to define the deriv- 
ative. From triangles one proceeds to arbitrary polygons; in a convex region 
this is trivially simple. Finally the proof is given for an arbitrary rectifiable 
curve. 

The Cauchy formula for f(z) cannot be established without specifying 
carefully the curve C around which the integration is performed. In some 
sense the point z must be interior to C. As an illustration of the phenomena 
which may arise, the integral (2) does not always have the expected value e 
or 0. However, our development needs only the fact that, for certain special 
curves C, the classic integral yields f(z). If C is a simple curve lying in the 
z-piane, that is, in the plane of points z+Ae where A is complex and arbitrary, 
and if z is interior to C in this plane, then the Cauchy formula holds. It is 
valid not only for that z but for all points of a neighborhood of z. 

Finally one establishes the Taylor expansion for f(z) in the usual manner. 
It may be seen that-in this development analytic functions could be defined 
equivalently by (1) the uniqueness of the derivative; (2) the uniqueness of 
the integral (Morera’s Theorem); and (3) the possibility of a power series 
expansion. The concept of conformality is meaningless in 9 except in the 
case that ® is the complex number field. For conformality may be defined as 
follows: R is a conformal ring if |az|=K.|z| where K. is some constant. 
Considering ® a ring of transformations (or equivalently, considering that 
|e| =1) and setting s=e yields |az| =|a| -|2| for all a, sin %. By Theorem 4, 
® is the field of complex numbers. 

A series of powers of z—a (which converges for some regular z—a) is 
called an element. An analytic function in the large is defined as the totality 
of elements which are continuations of a given one. It is curious to note that 
one formula may define more than one analytic function. For example, if G 
has more than one component, 1/z defines an analytic function in each. One 
may establish Liouville’s Theorem for functions analytic and bounded every- 
where. From this the fundamental theorem of algebra is obtained in the fol- 
lowing form: Given f(z) =cnz"+ + +¢o, »>0, c, regular, there exists at 
least one element a in ® such that f(a) is singular. The existence of a zero is 
not asserted. If c, is singular, f(a) may be regular for all a. 

It is seen that in this development partial differential equations of the 
Cauchy-Riemann type play no role. In the infinite dimensional case these 
equations would be of a most complicated type (being dependent on bases)— 
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if they could be set up at all. In the finite case, the development has been 
carried through by means of systems of partial differential equations("*). It 
would seem that this circumstance contributes to a lack of transparence in 
the given treatment. 

Ideal theory in ®. In what follows, it will be at times useful, at others 
necessary, to employ ideal-theoretic considerations in ®. The concept of 
maximal ideal is particularly important. A maximal ideal is one which while 
distinct from ® is not contained in any other ideal. Although an ideal is not 
necessarily a closed set, the word “ideal” will be used subsequently as an 
abbreviation for “closed ideal.” The basic results on ideals have been given 
by Gelfand and can be summarized as follows(“) : 


THEOREM 5. If 3 ts any ideal in R, the ring R/¥ is a normed abelian vector 
ring. An ideal can always be found which contains 3, RDY, and which is 
maximal. The ideal $ is maximal if and only if R/¥ is the field of complex 
numbers. 


It is of importance to note that many properties of ® remain invariant 
under the operation of residuation. An example will be given: if f(z) is a func- 
tion of z analytic at z=c, and if 3 is any ideal in R, then f(z) is a function of z 
in R=R/¥ analytic at 3=¢. For from the series 


f(z) = ao + — c) + — c)? + -- 
one obtains 


fe) = & + a8 — 2) 


The latter series converges for some regular since | 
s |an(z—c)*| , hence represents an analytic function. 

The following results characterize the structure of R in terms of the 
spectra of the elements of #. It has been seen in Theorem 2 how the reducibil- 
ity of R can be characterized in this fashion. An element a is nilpotent if for 
some positive integer n, a*=0. An element a is called a quasi-nilpotent if for 
all complex lima... (ua)*=0. 


THEOREM 6. An element a is a quasi-nilpotent if and only if the spectrum of a 
consists of the one point \=0. The quasi-nilpotent elements constitute an ideal R 
called the radical of R. The radical is the intersection of all maximal ideals. If 
R=R/K and ACH is the image of aER, a and a have identical spectra. Thus 
the ring R/ R ts without radical. 


If the spectrum of a consists of the point \=0, then by S, Theorem 7, 
(ua)"—0 and a is a quasi-nilpotent. Also if a is such that (ua)"—>0 the only 


(38) See footnote 3. 
(#*) Gelfand, loc. cit. pp. 6, 8, 9. 
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singularity of a is \=0. The remainder of the proof follows the ideas of Gel- 
fand. 


THEOREM 7. If & is an ideal contained in the radical R of R, R/F ts irre- 
ducible if and only if R is irreducible. 


THEOREM 8. If f(A) =ao+ar+a2d\?+ --- is a power series with scalar 
coefficients which converges absolutely for some \#0 and a is any element in R | 
whose spectrum lies entirely within the circle in the complex plane Iz = |x| , the 
series f(a) converges absolutely. The spectrum of f(a) is the transform under f(A) 
of the spectrum of a. 


Since the spectrum of \~'a lies within the unit circle, by S, Theorem 7, 
| =| 0. Thus for sufficiently large, |a*| <|A*| and f(a) 
converges absolutely. 

Now suppose \ is in the spectrum of a. Let $ be any ideal such 
that a=de($). Write p.(A)=aot+ --- +and*, n=0, 1,---. Then p,(a) 
= p,(A)e($). Letting m increase, one has f(a) =f(A)e($) or f(A) is in the:spec- 
trum of f(a). Next let wu be in the spectrum of f(a); let 3 be a maximal ideal 
such that f(a) =pe($). If a=Xe(3), repetition of the previous argument yields 
f(A) =. 

If in particular }>*.,|a,| < © and |a| =1, then once more the spectrum 
of f(a) is the transform under f(A) of that of a. An application of this result 
will be given. E 

Let f(2) be such that >> an| Then if f(z)#0 in | z| si, 

In proof, let ® be the ring of all f=f(z) for which }>*,|a,| <<. Let 
lf] =c2.,|an|. Then in ®, |z| =1 and clearly the spectrum of s consists of 
all \ with |\| $1. Thus by the theorem, the spectrum of f consists of all values 
assumed by f(z) for | s| $1. Since 0 is not one of them, 1/f(z) is in R. 


THEOREM 9. Let f(A) =aot+asrk+a2d\?+ --- (+0) be a scalar power series 
convergent for all X. Let aE be such that f(a) =0. Then there exists a polynomial 
P(A) (+0) with scalar coefficients such that p(a)=0. In particular if R is irre- 
ducible and without radical, a is a scalar, a=pe. 


Since the spectrum of a is bounded and the zeros of f(A) are finite in num- 
ber in any bounded set, the spectrum of a consists of a finite number of iso- 
lated points, Ax, - , An. Let ji, - be the idempotents 

dg 

2ridc, fe—a 
where C; is a circle in the complex plane centered on \,¢ and sufficiently small 
so as to exclude the points A,e, s*k. Then e=jfi+ - - - +7, and j,-j7,=0 for 
s#k (by S, Theorems 8d and 5). 
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Suppose first that »=1. Expand f(A) around A=A,, f(A) =B8o+6i(A—Ax) 
+--+, and f(a) - - -. If is any maximal ideal, it con- 
tains a—),e which is in the radical of R, hence it may be seen by a discussion 
modulo that an element yoe+7i(a—ue) +72(a—Are)*+ is singular 
if and only if yo=0. Since f(a) =0, =0. Suppose 6;=0, - - , 6,.1=0, 8,0. 
Then f(a) =(a—re)* [Be +Be41(a—Are) ++ - - - and (a—dse)*=0. If K is 
irreducible, then certainly n=1; if R is without radical, then one may take 
s=1. 

Now consider unrestricted. The element (a —d,e)7, belongs to the radical 
of the ring 7,9 consisting of all elements j,¢, aG®. For since jj =jx, then by S, 
Theorem 7, for large hence for all u, u*(a—dxe) = [u(a—de)jx as 
n— © ; then Theorem 6 is applied. Now it may be seen that fork=1,---,m, 
the ring 7,® is isomorphic with the ring Ri=R/(e—jx) where the symbol 
(¢—jx) represents a principal ideal. Hence since f(a) =0 in Rx, the previous 
paragraph shows that (@—),2)*=0 in Qi, that is, (e—ji:), 
k=1,---+, Thus, writing =] p(@) is in the intersection 
of the ideals (e—j,), R=1, - - - , m. Since this intersection is the zero ideal, 
p(a) =0. 

This theorem shows that an analogue to the Cayley-Hamilton theorem 
that every matrix satisfies its characteristic equation does not exist in R. In- 
deed the equation f(a) =0 implies that a is finite dimensional in the sense that 
the ring it generates is finite dimensional. 

The exponential and logarithmic functions. The function exp (z) is defined 


by the series >> *_,2"/n! which converges for all zE®. Its functional equation 
and the formula for the derivative are readily obtained. For all z, exp (2) lies 
in the principal component G, of %. First, the inverse of exp (a) is exp (—a); 
next, by the continuity of exp (z), if C is a curve joining 0 to a, then the map 
of C under the transformation z—exp (z) joins ¢ to exp (a). The relationship of 
the exponential to reducibility is interesting. 


THEOREM 10. The function exp (2) ts simply periodic if and only if R is ir- 
reducible. More precisely, the periods of exp (2) are exactly the elements 
2i).2_,tjx where the j, are idempotents, distinct or not. 


Direct substitution in the series for exp (z) shows that exp (27ij) =e in 
case j is any idempotent. Thus if exp (z) is simply periodic, then ® has 
but one idempotent (aside from 0), namely e; hence & is irreducible. Now 
suppose that for a given a in R, exp (27ia) —e=0. By Theorem 8, the spec- 
trum of a consists of one or more of the points \=0, +1, - - - . If a has more 
than one point in its spectrum, then ® is reducible (Theorem 2). Suppose 
now that a has only one point in its spectrum; by translation one may as- 
sume that A=0 is that point. Let b=exp (mia); then b?=e and (b+ is 
an idempotent. If be, b+ —e, R is reducible. In the contrary case, since the 
spectrum of a consists of \=0, by Theorem 8 the spectrum of exp (xia) +e 


| 
| 
| 
| 
| 
| 
| 
| 
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consists of \=2, hence + —e but rather }=e. Consider now in turn the ele- 
ments exp (ria /Z), exp (xia /4), and so on; either is found to be reducible or 
the analytic function exp (27riaz) has the value e for z=e/2", n=0, 1, 

The function is identically e and its derivative 27ia is 0. This shows that 3 
is irreducible if and only if exp (z) is simply periodic. 

In proof of the last statement of the theorem, let Au, - - - , Ax be the points 
of the spectrum of a, A, having an integral value. Let j;, - - - , 7, be the idem- 
potents defined for a as in Theorem 9. Then a=aji+ --- +aj,. Also 
exp (2riaj,) =e and is in the radical of k=1, - - m. Ap- 
plying the preceding paragraph. aj, giving @=Aajit - - +Anja. Except 
for notation, this is the statement in the theorem. 

The logarithm is defined by 

* dz 
log z = 
e @ 
where the path of integration joining e to z is arbitrary but must consist 
of regular elements. Thus the logarithm is defined for elements in the prin- 
cipal component only. It may be shown that for any a in ®, the equation 
log z=a has solutions. By what precedes and by the obvious properties of 
the logarithm, log e=2ri)_3_,+j. (Theorem 10). 

The components of G and continuous groups. The components G, of the 
group G of regular elements in ® have been defined earlier; the principal 
component G; is that containing e. The group G; consists precisely of the ele- 
ments a expressible in the form a=exp (0), bE€R. For if b€R, then 
exp (b)€G;,; if then leg a exists and a=exp (log a)("*). For every 
there is a 6’ such that b’€G, and exp (0) =exp (b’). Indeed if m is an integer 
such that 2xn>|b|, then b’=2xine+ satisfies these conditions. Thus the 
map of G, under the exponential function is G; in its entirety. 

A set of elements a*, where s is an exponent ranging over the real or com- 
plex numbers or over a more general range, constitutes a continuous additive 
group if a*-a'=a*t and if the group operation is continuous. In many in- 
vestigations it is of importance to determine whether a given element in ® 
may be embedded in such a group. 

THEOREM 11. An element aER may be embedded in a continuous additive 
group, a*-a'=a‘*', if and only if a is in the principal component G, of R. In 
that case the whole ring R serves as a range of exponents. The groups are all of 
the form a* =exp (s log a) where log a is any determination of the logarithm. 


If the elements a* form a continuous group, then letting s have real val- 
ues, 0SsS1, one may see that a is in G;. It is known that ali such continu- 


(*) This result is given by Nagumo, loc. cit. p. 66, Theorem 3. 
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ous groups with real exponents are of the form a*=exp (s log a)("*). If now s is 
allowed to range over R an additive group is obtained which is an extension 
of the given group. 

This concept of a general exponent seems to be new. Thus if a is a finite 
dimensional nonsingular matrix one may consider such expressions as a*. 


THEOREM 12. If the number of components of the topological group G is 
greater than one, it is infinite. 


Suppose that ¢ is a regular element in ® not in the principal compdénent G. 
Then the elements ¢*, n=0, +1, ---, belong to distinct components. For 
suppose that and 22, belong to the same component; then =i" 
is in G;. Let ¢* be embedded in a continuous group (Theorem 11), and if 
m0, let (¢")/™=s. Then if u=ts-!, u™=e. The spectrum of u consists at 
most of the mth roots of unity (Theorem 8). Thus ~€G;, hence t=suCG. 
This implies that m=0 and that /*, n»=0, +1,--- are in distinct compo- 
nents. 

Examples will be given to show how the regularity of an element and 
also the component in which it belongs are influenced by ring extensions and 
contractions. If a is regular in and the resolvent set of a is connected, then 
a€Gi; for a and a+ne with n>|a| lie in the same connected component of G 
and a+meCG,. The converse proposition is false. Let 9 be the ring of all 
complex-valued continuous functions f(#), 0S¢<1, with |f|=max|f(é)|. 
Let 92 be the subring of 9; generated by exp (27it) and exp (—2zi/). Let 
R; be the subring of 9, generated by exp (27i#) and 1. Then in Mi, exp (27?) 
is regular and since it is of the form exp (b) where © Mi, it is in the prin- 
cipal component. In %2, exp (27i#) is regular but not in the principal com- 
ponent; for if f(#)G Re, f(0)=f(1), hence Re. In Rs, exp (27it) is a 
singular element. For if one writes exp exp (2rist) =k(#) 
where and the a, are arbitrary, then |&| =max | exp (27it)| =| 
-exp (2mit)dt| =1. That all points \ with |A| <1 are in the spectrum of 
exp (27it) considered as an element of 3 is a consequence of the next theo- 
rem. 

The preceding discussion shows that spectral singularities may at times 
be removed by ring extensions. If R2CR: and aE R:2 and if a—Xe is singular 
in ®2 but regular in Mi, A is called a removable spectral singularity of a. In 
case \ is not removable, the singularity is called a permanent spectral singu- 
larity. 


THEOREM 13. All points belonging to the frontier of the spectrum of an ele- 
ment ain R are permanent spectral singularities of a. 


(8) The fact that any real continuous group a*, — © <s< ©, is of the form a*=exp (sb) is 
proved by Nagumo, loc. cit. p. 72, Theorem 9, See also D. S. Nathan, One-parameter groups of 
transformations in abstract vector spaces, Duke Math. J. vol. 1 (1935) pp. 518-526. 
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Since the scalar series ({—A)~'=)__,A*-{-* converges absolutely for 
|g-2| then by Theorem 8, converges absolutely (to 
(a—de) for any element a whose spectrum lies outside the circle 

Let and let If a—ype is regular in hence also in Mi, 
the convergence or divergence of the series for ([a—e]—de)—! is settled with- 
out reference to the ring to which a belongs. If v is a frontier point of the 
spectrum of 2 considered as an element of 2, this series cannot converge for 
any \ with |A| >|»—y]. This implies that a—ve is singular in ®; hence v is a 
permanent spectral singularity(*”). 

An application of Theorem 13 will be given. 

Let the spectrum of aER consist of real points only. Then the spectrum is 
invariant under ring extensions and contractions. 

Separation in ® by rectifiable curves. The theory of separation in R by 
means of rectifiable curves resembles strongly that in the complex plane. If 
C:z=2(t), OStS1, 2(0)=2(1), is a closed continuous rectifiable curve (for 
definition see above), C is called simple if t;4t, (mod 1) implies [2(t,:) —2(é) ]- 
exists. If C is such a curve in ® and $ is any ideal then C:2=2(¢) is such a 
curve in R =R/. For C is clearly simple and since the norm of 2(#) in R does 
not exceed the norm of 2(é) in ®, C is rectifiable. In particular, if $ is maximal, 
R is the field of complex numbers and C is a simple closed rectifiable curve in 
the usual sense. 

If C:z=2(t) is as above and aE & is such that 2(¢) —a is singular for some ?, 
a is called C-singular. If 2(¢) —a is regular for all ¢, a is called C-regular. There 


exist C-regular elements. For example a=2(0)+)e, where || is sufficiently 
large, is C-regular. Since C is a compact set the set of C-singular elements is 
closed, that of C-regular elements is open. Consider the integral 


1 dz 


4 
(4) 2—a 


where a is C-regular. Let b be on Cand let 2’ = (s—a)/(b—a). This substitution 
leaves the integral (4) unchanged in value and replaces C by a simple curve 
C’ lying in G; and containing e. Hence u=(2ri)-! log e=)_7.,+j% where the 
je are idempotents. If ® is irreducible, u=ne where n is an integer (Theo- 
rem 10). 

Let $ be a maximal ideal and let 2(¢#)=f{(#)(3). Then under the homo- 
morphism R—-R=R/F, the integral (4) is transformed into a classical in- 
teyral @ in the complex plane. Choosing a proper sense along C:{={(2), 
hence along C, #=0 or #=1. Since 2=né, n=0 or 1. The C-regular elements 


(*") This result is implied by a lemma of G. Silov, On the extension of maximal ideals, C.R. 
(Doklady) Acad. Sci. URSS. N.S. vol. 29 (1940) pp. 83-84. Silov proves that a—ve is a general- 
ized divisor of zero, hence » is a permanent spectral singularity. An element } is called a gen- 
eralized divisor of zero if there exist elements c,, m=1, 2, - + - , such that | cn| =1 and | ben| 0 
as 
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for which u=e will be called C-interior elements. Those for which u=0 will 
be called C-exterior. The C-interior elements form an open set which need not 
be connected. The same is true of the C-exterior elements. Exterior elements 
exist: if b is on C, b+Ne is exterior to C for large ||. The existence of interior 
elements will be discussed below. In résumé, one has the following theorem. 


THEOREM 14. Jf R is an irreducible ring, a simple closed rectifiable curve 
C:2=2(t) partitions by means of the integral (4) the elements a of R into three 
classes: 

(1) The closed set of C-singular elements. 

(2) The open set of C-exterior elements a for which u=0. 

(3) The open set of C-inierior elements a for which u=e. 


The existence of interior elements is related to differentiability on C in 
the following theorem. 


THEOREM 15. If C:z=2(t) is a simple closed rectifiable curve in R and if for 
some t the derivative 2'(t) exists and is a regular element, then C has interior 
points. 

Suppose 2’(#) exists and is regular at t=%; for t—t+0, write (z(t) 
—2(to))/(t—to) =a+h where a=2'(t). Suppose (a—b)-! exists for DER, 
|b| <r where r is some constant. Choose ¢, 0<¢<r, and find v>0so that when 
0<|t—t| <v, <e€. Choose a 5>0 such that for | z—2(to)| <8, |t—to| 
(z—s(t))—! exists. Let r be a real number not equal to 0, | ra| <6. Then 
w(t) =2(t) —2(to.) —ira is regular for all ¢. This is clearly true for | —to| <v. 
For | t—to| write w(#) in the form w(t) = (t—to—ir) [a+ 
and it is seen here also that w(#) is regular. 

If $ is a maximal ideal in R, and 2(¢)={(é)(3), it may be seen that 
(to) =a exists and The point {(é)+é7a@ lies within the curve {={(é) 
either for a r>0 or a r<0. Thus the integral (4) in R=R/ has for this 
choice of r the value #=1. Hence u =e and 2(to) +i7<a is interior to C. 
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1. Introduction. In this paper we shall deal with the following general prob- 
lem: Let f(x', x7, -- +, x", 0, +--+, 0*) be the joint probability density func- 
tion of the variates (chance variables) x', - - - , x" involving unknown pa- 
rameters 6', - - - , 0*. Any set of k values 6', - - - , 0* can be represented by a 
point @ in the k-dimensional Cartesian space with the coordinates 6, - - - , 6*. 
We shall denote the set of all possible parameter points by Q. The set Q is 
called parameter space. The parameter space 2 may be the whole k-dimen- 
sional Cartesian space, or a subset of it. For any subset w of Q, we shall 
denote by H,, the hypothesis that the parameter point lies in w. If w consists 
of a single point, H, is called a simple hypothesis, otherwise H, is called a 
composite hypothesis. In this paper we shall discuss the question of an ap- 
propriate test of the hypothesis H, based on a large number of independent 
observations on x!, - x". 

For simplicity we shall introduce the following notations: The letter 0 or 6; 
for any subscript ¢ will denote a point in the parameter space Q. The letter x 


Some of the results contained in this paper were presented to the Society, February 22, 
1941 and September 2, 1941; received by the editors March 31, 1943. 
(*) Research under a grant-in-aid from the Carnegie Corporation of New York. 
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will denote the random vector with the components x',---, x’, and xq 
will denote the random vector with the components x}, ---, x, where xf 
(¢=1, 2,---+, 7) denotes the ath observation on x‘. In general, the com- 
ponents of a vector v in the s-dimensional space will be denoted by 2’, - - -, v’, 
that is the components of a vector will always be indicated by superscripts. 
Throughout this paper all vectors have their initial points at the origin. 
We denote by £, a sample point in the rn-dimensional sample space of 
independent observations on the random vector x. For any relation R we 
denote by P(R| 6) the probability that R holds under the assumption that 0 
is the true parameter point. A region (subset) of the rm-dimensional sample 
space will always be denoted by a capital letter with the subscript . For any 
region W, the symbol P(W,,| 6) will denote the probability that £, falls 
within W,, under the assumption that 6 is the true parameter point. Through- 
out this paper the word “region” will be used synonymously with “subset,” 
since in the theory of testing statistical hypotheses it is customary to call the 
subsets which are used as criterions of rejection, critical regions. 

By maximum likelihood estimates 61,---, & of @,---, 0* we mean 
values of +--+, for which [[%_,f(xi, ---, x%, 0*) becomes a 
maximum. The subscript m in the symbol 6 will indicate that the maximum 
likelihood estimate is based on m independent observations on x’, +--+, x’. 
A region W, in the rn-dimensional sample space is called a critical region 
for testing the hypothesis H, if we decide to reject H, when and only when 
the observed sample point falls within W,. For any @ not in w the value of 
P(W,,| 6) is called the power of the critical region W, with respect to the 
alternative hypothesis 6. The least upper bound of P(W,| 0) with respect 
to 6, restricting @ to w, will be called the size of the critical region W,. A criti- 
cal region is considered the better, the smaller its size and the greater its 
power. 

In several previous publications(*) the author has considered the case of 
a single unknown parameter @ and the problem of testing a simple hypothesis 
6=6». It was shown, among other things, that under certain conditions the 
critical region given by the inequality | / *(6,— 60) | 2A, has certain optimum 
properties. Here the symbol A, denotes some properly chosen constant. In 
this paper the general case of several unknown parameters is treated and 
simple as well as composite hypotheses are considered. 

By an equality or ittequality among vectors we sha!l mean that the equality 
or inequality holds for all components. For example, if 6, denotes the vector 
with the components 63, - - -, 6, where 6{ is the maximum likelihood esti- 
mate of 6‘, and if A is a real number, then the inequality 


— 0) <A 


(*) Asymptotécally most powerful tests of statistical hypotheses, Ann. Math. Statist. vol. 12 
(1941). Some examples of asymptotically most powerful tests, Ann. Math. Statist. ibid. 
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denotes the set of inequalities 
<A 

Or, if ¢ is a vector with the components ?#, - - - , #*, then 
n'l2(6, — 0) <¢ 


denotes the set of inequalities 
<# (j= 1,---, 2). 
2. Assumptions on the density function f(x,0). For any function ¥(x) weshall 
use the symbol {*$y(x)dx as an abbreviation for ft Sy(x) dx!- - - dx’. 
Denote by Eaf(x) the expected value of ¥(x) under the assumption that @ is 
the true parameter point, that is 


+00 
= v(x) f(x, O)dx. 


For any x, any positive value 5, and any 6, denote by yi;(x, 4, 5) 
the greatest lower bound, and by ¢i;(x, 0, 5) the least upper bound of 
0? log f(x, @)/00‘00/ with respect to @ in the 6-interval |@—6,| sé. 

Throughout this paper the following assumptions on f(x, @) are made: 

AssumPTION I. Denote by D, the set of all sample points E, for which the 
maxinium likelihood estimate 6, =(61, - - - , 6) exists and the second order par- 
tial derivatives 0°f (xa, are continuous 
functions of 0. It is assumed that 


lim P(D,|6) = 1 uniformly in 0. 
n= 
If for a sample point E, there exist several maximum likelihood estimates, 


we can select one of them by some given rule. Hence we shall consider 6, as 
a single-valued function of Z, defined for all points of D,. 


AssuMPTION II. For any positive ¢ 
lim P[| 6, —0| <«|@] =1 


uniformly in 0, where 6, denotes the vector with the components 61,---, & 
and 6, is the maximum likelihood estimate of 6°. 


Assumption II is somewhat more than consistency of the maximum likeli- 
hood estimate 6,. In fact consistency means only that for any positive e 


lim P[| 6, — < «| 0] =1, 


without requiring that the convergence be uniform in 0. If 6, satisfies Assump- 
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tion II we shall say that 6, is a uniformly consistent estimate of 0. A rigorous 
proof of the consistency of 6, (under certain restrictions on f(x, 6)) was given 
by J. L. Doob(*). The uniform consistency of 6, together with the uniform 
consistency of the likelihood ratio test will be proved on the basis of some 
weak assumptions on f(x, @) in a forthcoming paper. 


AssuMPTION III. The following three conditions are fulfilled: 

(a) For any sequences {On}, {O2n}, and for which On 
= 6 and lim 5, =0 we have 

log f(x, 6) 

uniformly in 6. 

(b) There exists a positive € such that the expectations Ey,[Wi;(x, 2, 6) ]* and 
Ex, [bi3(x, 62, 5) ]? are bounded functions of 01, 02 and 5 in the domain D, defined 
by the inequalities |0,—02| Sand | 5| Se. 

(c) The greatest lower bound with respect to 0 of the absolute value of the de- 
terminant of the matrix || — log f(x, is positive. 


Assumption IV. 0)/00' dx = 0)/00'00' dx =0. 


Assumption IV simply means that we may differentiate with respect to 0 
under the integral sign. In fact 


+00 
f f(x, dx =1 


identically in 0. Hence 
2 


a +00 
— x, Idx = 
fx ) 


Differentiating under the integral sign, we obtain the relations in Assump- 
tion IV. 


+00 
f(x, dx = 0. 


AssuMPTION V. There exists a positive n such that 


a 


are bounded functions of 6. 


3. The joint limit distribution of 6,. Denote n/2(6—6*) by 24(6*) and let 
2,(0) be the vector with the components 23(6"), - - - , 24(0*). For any constant 
vector ¢ denote the probability by ®,(¢| 6). We shall prove the 
following proposition. 

(*) J. L. Doob, Probabslity and statésttcs, Trans. Amer. Math. Soc. vol. 36 (1934) pp. 759- 
775. 
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ProposiTIOn I. The distribution function &,(¢| 6) converges with n— ©, uni- 
formly in t and 0, towards the cumulative *tultivariate normal distribution with 
zero means and covariance matrix 


= 


¢:;(0) = — log f(x, 
Proof. Because of Assumption IV, we have 


06+ 


(1) = 0. 


0? log f(x, 1 0? f(x, 0) 1 


we obtain because of Assumption IV 


From (2) it follows that the matrix ||c;;(9)|| is positive definite or semidefinite. 
Because of condition (c) of Assumption III the matrix ||c;;(8)|| must be posi- 
tive definite. For any point EZ, of the set D, defined in Assumption I consider 
the Taylor expansion 


log f(%a, 41) log f(%a, 4) 


where @ lies on the segment connecting the points @ and 6;. Denote 
log f(xa, by and let be the vector with the compo- 
nents y.(@),---, yk(0). Substituting 6, for @, in (3), the left-hand side of 
(3) becomes equal to zero and we obtain 


‘ 1/243 gad log f(xa, 
(4) nO +E —[ 
i n 


(3) > log 


00 ‘00? 


« 


or 


1 0? | 
(5) + p 2, (0) > = 0. 


« 


where 
From 
-° 
= 
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Let v be an arbitrary positive number and let Q,(@) be the subset of D, for 
which the inequality 
1 log f(xa, 9) 


+ 0 <p 
8090: 


(6) 


holds. We will prove that 
(7) lim P[Q,(6) | 0] = 1 


uniformly in @. 
Let 79 be a positive number such that 


d? log f(x, 6) | v 

2 

log f(x, 4) | 
| 2 


| x, 6, To) Es 
(8) 

| Ewi;(x, 6, To) Es 
for all values of @. Because of condition (a) of Assumption III, such a 7» cer- 


tainly exists. Denote by R,(@) the subset of D, consisting of all points Z, for 
which the inequality 


(9) | 6, S 70 
holds. Because of Assumption II 
(10) lim P[R,(6) | 6] = 1 


uniformly in 0. Since 4 lies in the interval [6,, @] we have for all points of R,(6) 
(11) | — 6*| < (i= 1,-++, 
Hence at any point of R,(@) the inequality 


2? log f(xa, 8) 


(12) 9 70) SO 
holds. Let the region S,(0) be defined by the inequality 

1 
(13) j(Xa; 6, To) = (x, 6, < 


and 7,(0@) be defined by the inequality 


1 
(14) Vii(Xay 6, 70) Ewi;(x, 6, To) < 


It follows from (b) of Assumption III and Tshebysheff’s inequality that: 


| 
| 
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(15) lim P[S,(6)| 6] = lim P[T7,(6)| 0] = 1 


uniformly in 6. Denote by U,(@) the common part of the regions R,(@), S,(@) 
and 7,(@). In U,(@) we have because of (8), (13) and (14) 


3? log f(x, 6) | 

00 

log f(x, 6) | 

| 
00307 


j(Xa, 9, to.) — Ee 


(16) 
| > Vii(Xay 6, 70) Es 


From this we obtain (6) because of (12). That is to say, the inequality (6) 
is valid everywhere in U,(0). Since limaae P[U,(6)|@]=1 uniformly in @ and 
U,(@) is a subset of D,, our statement about Q,(@) is proved. 

Since the determinant |¢;;(@)| has a positive lower bound, we obtain 
easily from (5) and (6) 


(17) =  yal0) + vesin(En, 6, »)], 


where €ij.(EZ,, 9, v) is a bounded function of E,, 0, and v, provided that, for 
each 6, E, is restricted to points of Q,(6) and |»| is less than a certain posi- 
tive number 7. 

Let 2,(6) be definéd as follows: 2,(@) =z,(@) at any point of Q,(0), and 


= 


at any point outside Q,(@). It follows from (7) that 
(18) lim {P[z,(0) < ¢| — < =0 


uniformly in ¢ and @. 

Denote >> y,(0)o:;(0) by #(@) and let &(@) be the vector with the com- 
ponents £4(0),---, #(0). From (1), (2), Assumption V and the general 
limit theorems it follows that P[y,(@) <¢|@] converges with n— ©, uniformly 
in ¢ and @, towards the k-variate cumulative normal distribution with zero 
means and covariance matrix ||c;,(0)||. From this we easily obtain that 
P[é,(8) <t|@] converges with n—> ©, uniformly in ¢ and 0, towards the cumu- 
lative joint normal distribution with zero means and covariance matrix 
||o.3(0)||. Since v can be chosen arbitrarily small, we obviously have because 
of (17) 


(19) lim { P[z,(0) < ¢| 6] — < =0 


uniformly in ¢ and @. Proposition I follows from (18) and (19). 
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4. Reduction of the general problem to the case of a multivariate normal 
distribution. In this section we shall prove two lemmas which will enable us 
to reduce the general problem of large sample inference to the case where the 
variates under consideration have a joint normal distribution. 


Lemma 1. For each positive integer n there exists a set-function W,*(W,) 
defined over all Borel measurable subsets W,, of the rn-dimensional sample space 
such that the following two conditions are fulfilled: 

(a) For each W., W,.*(W,) ts a Borel measurable subset of the rn-dimen- 
sional sample space with the following property: For each point of W,*(W,) the 
maximum likelihood estimate exists and if a sample point E, lies in W,*(W,) 
then also all those points lie in W.*(W,) for which ) =6,(En). 

(b) Lim... {P[W.*(W,)|0]—P[W,|0]} =0 uniformly in 0 and W,. 


Proof. Let \ be a real variable which takes only non-negative values and 
consider the region W,(0, A) defined as the common part of the region W, 
and the region | 21/2(6, —6) | Sx. Similarly let W.*(W,, 0, A) be the intersec- 
tion of W,*(W,) and the region | m/*(6,—6)| SX. 

For any function we will denote by g.l.b., ®(v) and |.u.b., the 
greatest lower bound and the least upper bound of ®(v) respectively. 

Since, on account of Proposition I, for any sequence {d,} for which 
An = © 


lim [P[| 2/26, — 6)| < = 1, 


Lemma | is proved if we show that there exists a sequence {A,} not depending 
on @ and W, such that lim,... A, = © and 


(20) lim { P[W,(0, | 0] — P[W2(W., 8, @]} = 0 


uniformly in W,, and 6. 

Let g be a real variable restricted to values greater than 1. For any set 
of & integers - rx) and for any value g denote by J,(r, ---, 
the region defined by the inequalities: 


— 1/2 r+ 1/2 — 1/2 r 1/2 


Furthermore denote by @,(f1, - - -, rx, g) the parameter point with the co- 
ordinates 17;/qn'/*,---, r,/qgn*. We order the system of all sets of & in- 
tegers (ri, -- +, f%) in a sequence and we shall denote by J,,(g) the interval 
In(t1, Tk» Q) where is the sth element in the ordered se- 
quence (s=1, 2,---, ad inf.). Similarly 0,,(q) denotes the parameter point 
On(r1, Where (71, , is the sth element in the ordered sequence. 


—— 
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Let Jns(W,, g) be the common part of the three regions J,,.(g), W, and 
Q.[@ne(g) ] where for any 0, Q,(0) is defined by the inequalities 
1 1 1 
—> 6, =) + | s + v(m) + 
n n 


nN 


(22) 


1 1 1 
—> 6, + ¢;;(8) | v(m) + (mn). 
n n 


The expression v(m) is equal to l.u.b.» 0, — 0, ] 
and the expression »(”) is defined as follows: Denote by c;;(@, m) the least 
upper bound of | ¢;(@) —c:;(8)| with respect to 9 where 9 can take only values 
in the interval [@—1/n"*, 6+1/n"*]. Then ¥(m) is defined as the least upper 
bound of ¢;;(@, m) with respect to 6, 1 and j7. Because of condition (a) of As- 
sumption III, we obtain 

(23) lim = lim = 0. 

Let J,.(W., g) be a subset of J,.(g) such that the following two conditions 
are fulfilled: 

(24a) If EZ, is an element of J(W., g) then also all those points EZ, for 
which 6,(E,) =6,(E,) are elements of J,5(W,, g), that is g) can be 
represented as a subset of the space of the maximum likelihood estimates. 
Furthermore J,4(W,, @) is an interval in this space. 

=0 where $( Va 6) denotes the probability of V, calculated on the basis that 
the joint distribution of /2(6,—6'), ---, m2(65—6*) is normal with zero 
means and covariance matrix ||o;;(6)|| 

The existence of such a set Jx(W,, g) can be proved as follows: Obvi- 
ously there exists a subset J,4(W,, g) of Ins(g) such that (24a) is fulfilled and 


| = min { 9) | One(q)], BL | ]} 


Since Jne(Wa, g) is a subset of I,.(g) and since ] 
— [Tne(q) | ]} =O uniformly in s, the above defined subset J%,(W,, 
satisfies also the condition (24b). We define 


e=1 
Furthermore we define the regions J,(W,, 8, A, g) and J,*(W,, 0, », g) as 
follows: 


(26) 9,2, = Ds 


8 
& 
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where the summation is to be taken over all values of s for which |@—0n4(q) | 

Let be the intersection of J(Wa, g) and Qn[@ns(q)], and let 
J*(Wa, 8, where the summation is to be taken over all 
values of s for which Furthermore let W,*(W,, 0, A, g) be 
the intersection of q) with the region | n/*(6,—0)| <2. 

If for a value of s we have n'2| 0—One(q) | SA then for all points of Ins(q) 
we have n/?|9—6,| SA+1/g. If there exists a point in I,,.(g) for which 
n2|@—6,| S\—1/q then n/?|@—6,.(g)| SA. Hence W.*(W,, 0, A—1/g, is 
a subset of J,*(W,, 0, A, g), and the latter is a subset of W,*(W,, 8, 4+1/g, q). 
Thus 


(28) < 8, 1/q, q)| 6] — — 1/g, 6] 
< — 1/q 6, —0| SX + 1/q| 6]. 


According to condition (b) of Assumption III E[¢:;(x, 0, 5)]? and 
0, are bounded functions of 6. Hence also Ee¢i;(x, 0, 5) and 
Ea):;(x, 0, 5) are bounded functions of 6. Substituting 6 =0 we find that c;;(@) 
is a bounded function of 0. From the boundedness of c;;(@) and from the 
fact that the determinant | ci3(8) | has a positive lower bound (condition (c) 
of Assumption III) it follows because of Proposition I that 


(29) lim {l.u.b. P[A — 1/q¢ S n¥/?| 6, —0| SX + 1/q|e]} =0 
q=x 
uniformly in A. From-(28) and (29) we obtain 
(30) lim sup{1.u.b.| P[W2(Wn, 0, d, g) | 8] — », g)| |} = ald, 
n= co 6, 


where lim,.. «(A, g) =0 uniformly in A. 

Denote by R,(6, A, g) the common part of the regions Q,[6,.(q)] formed 
for all values s for which |@—0,4(q) | <=d/n"/?, Then for almost all 2 the region 
R,(@, X, g) contains the region T,,(@) as a subset, where T,,(8) is defined by the 
inequalities 

1 2 1 
(31) 


1 2 1 
| 6, + | ms + v(n). 


In fact, from (31) it follows that 
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1/3 nls 


1 
ns v(n) >> s 6:;(8) + —-+ v(n), 

(32) j 


Since 


2 


for almost all ” and for all s for which nv2| 0—8n2(q)| SX, we obtain from (32) 


1/3 nis 


1 

(33) 

— — — S — One (Q), S :;(0) v(m), 
for almost all m and for all s for which | <i. Since | 
—ci;(0)| for almost all m and for all s for which | SAX, we 
obtain from (33) 


1 1 1 


1 
S + + v(n) + ¥(n), 
(34) 


nls 


S [0n0(g)] + — + + ¥(n) 


nls 


for almost all m and for all s for which n'/?|@—6,.(g)| SA. The inequalities 
(34) are equivalent to tise inequalities (22) if in (22) 0,..(q) is substituted for @. 
Hence our statement about the region R,(0, A, g) is proved. 


436 
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Consider the region U,(0) defined by the inequalities 


1 2 2 1 


1 2 2 1 


(35) 


v(m) = | (« 6, + | 


v(m) | + 6, —) 


the validity of (35) implies the validity of (31). Hence U,() is a subset of 
T,(@). From condition (b) of Assumption III and Tshebysheff's inequality it 
follows that lim... P[U.(8)|@]=1 uniformly in 6. Hence lim,. P[7.(0)| 6] =1 
uniformly in @. Thus, as can easily be seen, 


(36) lim P[R,(0, a, g) | 6] = 1 


for any bounded sequence { An} uniformly in @ and gq. 

Let Jns(Wa, g) be the intersection of the regions W,, and J,.(g). Further- 
more let 9, g) be equal to >>.Jns(Wn, where the summation is 
to be taken over all values of s for which n” 2 0—Ons(q)| SX. Then the common 
part of ,J,,.(q) and W,(0, \—1/g) is a subset of J,(Wa, 9, g), and the 
last is a subset of the common part of >>7,J,,(g) and W,(0, A+1/g). Hence, 
since we have 


(37) < P[W.(0, 1/g)| — P[W.(6, — 1/9) | 8] 
< — 1/q n'*| 6, — 1/q| 6). 

From (37) and (29) it follows that for any sequence {qn} for which lim g,= ©, 
we have 
(38) lim | P[W.(6, d)| 0] — 0, , gn) | 0] | = 0 
uniformly in 6, W,, and X. 

Since the common part of the regions J,(W,, 6, g) and R,(0, A, g) is 


contained as a subset in J,(Wa, 8, A, g) and since the last is a subset of 
J.(Wa, 9, , @),-we obtain from (38) and (36) that for any bounded sequence 


437 
Since | 
and 
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{X,} and for any sequence {g,} for which lim,-. gn= © we have 


(39) lim {P[W,(0, Ax) | 0] — P[Jn(Wa, 0, An» | = 0 
uniformly in 6 and W,. 

Since the common part of the regions J,*(W,, 0, A, g) and R,(8, A, g) isa 
subset of J,*(W,, 0, \, g) and the last is a subset of J,*(Wn, 0, \, g) we obtain 
from (36) that for any bounded sequence {A,} and for any sequence {gn} 
for which lim,.. gn = ©, we have 


(40) { P[Tn(Wa, Xn qn) | ©] — P[Ta(Was 0, Gn) | = 0 


uniformly in @ and W,,. 

Now we shall evaluate the limit values of P[J,(Wn, 9, , q)|@] and 
P[7.*(W., 0, , g)|@]. Denote by A,(A, g) the domain in the space of the 
variables 0, W,, 0’ and E, defined as follows: 9 and W, can take arbitrary 
values, 0’ is restricted to values for which |e’—6| S)/n"?; and for any 6 and 
W,, E, is restricted to points which lie in the sum of the sets J,(Wa, 8, X, g) 
and J,*(W,, 8, , g). Denote furthermore by pn(6’, 0, 6,) the function 


(41) pn(0', 8, bx) = — m(6" — — bn)cas(6). 
- j 
Consider the ‘l'aylor expansion 


log f(a 6’) 
(42) 8? log fla, 8) 


n i 1 
log f(%a, On) + (0 —6,)(0 — 6,) 


where @ lies in the interval [8’, 6,]. 

Since in the domain A,(A, g) any point E, lies in the sum of the sets 
Jn(Wa, 9, g) and J,*(W,, 9, g), it follows from the definitions of these sets 
that E, lies in the set >-,Q,[0..(¢)] where the summation is to be taken over 
all values of s for which n/?|@—0,,(q)| SA (the set Q,(@) is defined in (22)). 
Hence for any £, in the domain A,(A, g) we have 


1 1 1 
nN a n 


nils 


(43) 


1 1 
nN « n 


for that value of s for which £, lies in J,.(g). In all that follows, with any 


n= 


1943] TESTS OF STATISTICAL HYPOTHESES 439 


point of the domain A, (A, g) we shall associate the integer s for which £, lies 
in Ins(q). Since m/?|@—0,,(g)| SA in the domain A,(A, g), we have in the 
domain A,(A, q) 


(44) 


for almost all values of . From the definition of 9(m) and from the validity 
of the inequality n2|@—6,.(g)| SA in A,(A, g) we find that in the domain 


A,(A, 
(45) | ] — S 


for almost all values of ~. From (43), (44) and (45) it follows that in the 
domain A,(A, q) 


1 
dij a) + te v(n) + 29(n), 


(46) 
+ cal | < + v(n) + 25(n) 


nils 


for almost all values of m. * 
Since @—6,.(q)| SA in A,(A, g), we have 


(47) SX+1/q in A,(A, Q) 
and therefore 

(48) n2|@’ —6,| in Q). 
Since @ lies in the interval [6’, 6,], from (48) we obtain 
(49) n'!?|6—6,| in A,(A, 9). 
From (47) and (49) it follows that in A,(A, g) 


00°00? 
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for almost all m. Because of (23), (46) and (50) we obtain from (41) and (42) 


(51) lim Dd log f(xe, — >> log f(xa, On) — pal6’, = 0, 


and 


lim [x log 9) — log f(xe, 


gee A,(Q,@) a 


(52) 
— [pn(0, 0, On) — pn(0’, 8, n) = 0. 


Denote P[Jns(Wa; g) | by g) and P[T5(W,, q)| ] by 
P* (Wa, Substituting for 6’ we easily obtain from (52) 


lim {iub. P[J.(Wn, 9, 9) | 


(53) 
Pal Wa, q) exp (Pn 6, Pn [One(Q), 6, = 0 


6,W, 


where the summation with respect to s is to be taken over all values for which 
and 6%,(g) is a parameter point for which 


(54) — One(q) | S 1/9. 


Since ¢;;(@) is a uniformly continuous and bounded function of @, it follows 
from (41) and (54) that 


| 0(0, 8, — 8, Ona(g)]| = 


55 * * 
| 9, One(Q)] — [One(q), 8, One(g)]| = 


where ¢:[0, 0%,(¢), One(q)] and ¢2[0, 0%,(¢), One(g)] are bounded functions of 
6, 6%,(q) and 0,,(g) in the domain n/?| @—6,,(g)| $d. From (53) and (55) it fol- 
lows that 


lim sup P[J.(Wa, 8, d, | 0] 
6, n 


(S6) 
Prs(Wa, (Pn 6, | — Pn 9, One(Q) ) = (A, g) 


where 
(57) lim ¢(A, g) = 0 
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uniformly in \ over any finite positive interval. Similarly we obtain 


lim sup {iub. 0, d, | 0] 


see 


> q) exp 6, Ons(q) | Pn 6, ) n(r, q) 


lim 7(A, g) = 0 
q=a 


uniformly in \ over any finite positive interval. In the formulas (56) and (58) 
the summation with respect to s is to be taken for all values for which 
|@—@ne(g)| SA/n*. The expression pn[6n2(g), 9, is obviously equal to 
zero, hence (56) and (58) can be simplified by substituting zero for this ex- 
pression. Denote P[J2(Wn, q)| ] by ). Because of Proposition I 
we have 


lim 9) | — = 0 
uniformly in s and W,. Hence we obtain from condition (b) of (24) 


(60) lim { 9) — = 0 


uniformly in s and W,. Since J5(W,, g) is the intersection of Q,[6,.(q)] with 
Ja(W., g) and since 


lim [One(q)] | One(q)] = 1 


uniformly in s, we have 


(61) lim 9) — = 0 


uniformly in s and W,. Since for any given \"and‘g the"number of different 
values of s satisfying the inequality n'l2|@—0,,(g)| SA isfa bounded function 
of 6, from (56), (58), (60) and (61) we obtain 


lim sup {L.u.b. | P[J.(Wa, 8, d, | 0] — 8, | ]} 
n= 


(62) 
= f(A, q) S + q)- 


Hence 
(63) lim ¢(A, g) = 0 


441 

where 

| 

| 
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uniformly in A over any finite positive interval. 

For any positive \’ the sets J,(Wn, 6, A+X’, g)—JIn( Wa, 8, A, Q), 
W.F(Wa, 8, g) — W.*(Wa, 8, A, g) and W,(0, — W,(8, A) are sub- 
sets of the set defined by the inequality 


A— 1/q SX4+N 1/¢. 
Since for any sequence {X,} for which lim,.. \n= © we have 


lim P[\, — 1/q 0 — 6,| S +’ + 1/q| 0] = 0 


uniformly in 6, g and X’, (39) and (40) hold for any arbitrary sequence {rw} 
and (63) holds uniformly in \ where \ can take any positive value. Thus 
from (30), (39), (40), (62) and (63) we obtain 


(64) lim sup {l.u.b. | P[W.(0, d)| 6] — P[W2(W., 8, », g)| 6] | } = 9), 
a=@ 6,W, 


where 


(65) lim ¢3(d, g) = 0 


uniformly in \. Let {q:} (¢=1, 2, - - -, ad inf.) be a sequence of positive 
integers such that lim... g:= + ©. Furthermore let {ns} be a sequence of 
positive numbers such that lim... 7;=0. We define W,*(W,) as follows: 

(66) Ws(Ws) = for <n S mui (i= 0,1,---, ad inf.). 


The sequence {n,} (¢=0, 1, 2, - - - , ad inf.) of integers is chosen as follows: 
Denote by F,(A, g) the expression 


Lu.b. | P[W.(8, 6] — P[W2(W,, d, 9) | 


The integer mo is put equal to 0 and m; is chosen such that 
ni > Ni-1, 


for all »>mn,;, and where {rx} denotes a sequence of numbers such that 


X= + 
Let A” =x, nf =i, and for ny <n (¢=0, 1, 2, - - , ad inf.). 


Then from (64), (65) and (67) we obtain 


(67) 


(68) lim {L.u.b. | P[W.(@, 6] — P[W(Wa, Qn) | = 0. 
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Denote by W,*(W,, 0, ) the intersection of W,*(W,) and the set defined by 
the inequality n/?|@—6,| <A. Since 0, g) is the intersection of 
W,*(Wa, g) and the region nu2| 6—6,| SX, it follows from (66) that 


(69) W(Wa, 8, Qe) = Wal(Wa, 8, del). 


Equation (20) follows from (68) and (69). Hence Lemma 1 is proved. 

We shall say that a region V,* lies in the space of the maximum likelihood 
estimates if it has the following property: If EZ, is an element of V,* then 
also all those points E,’ for which 6,(E,’)=6,(EZ,) are elements of V,*. In 
all the following considerations the symbol * as a superscript in the notation 
of a region will indicate that the region lies in the space of the maximum 
likelihood estimates, except if a statement to the contrary is explicitly made. 
For any region V,* we shall denote by $( V,*| 0) the probability that the 
sample point will fall in V,* calculated under the assumption that 
6), ---, have a joint normal distribution with zero 
means and covariance matrix ||o;;(0)|| 


LemMA 2. There exists a function W,*(R,*) defined over all Borel measur- 
able subsets R,* such that 
lim 6] — | = 0 


uniformly in 0 and R,*. 

Proof. Since we assumed that the set J,5(W,, g) defined in (24) is an in- 
terval in the space of the maximum likelihood estimates, it follows from 
Proposition I that 
(70) lim {P[Jn(Wa, 9) | 0] — = 0 


uniformly in 6, W,, and s. Let W,*(W,, q) be the set defined in (25) and let 
W.*(Wa, 9, g) be the intersection of W,*(W,, g) and the region 6—6,,| 
<x. For given values of \ and g the number of different values of s, for which 
J(W., q) has at least a point common with the region n/*|@—6,| SX, is 
a bounded function of 6. Hence it follows from (70) that 


(71) lim { P[W2(W., 0, », g)| 0] — BIWa(Wn, 8, g) | = 0 


uniformly in @ and W,. From (64), (65) and (71) we obtain 


(72) lim sup {Lu.b. | P[W.(@, 0] — B[WA(Wa, d, | = eA, 


where 


| 
| 
i 
i 
n=O 
4 
| 
i 
| 
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(73) lim «(A, g) = 0 


uniformly in \. The set W,(@, A) denotes the intersection of W, and the re- 
gion 6—6,| Si. 

Let {qi} (¢=1, 2, ---, ad inf.) be a sequence of positive integers such 
that lim,.. g:= ©. Furthermore let {;} be a sequence of positive numbers 
such that lim,... 7; =0. We define W,*(W,) as follows 


(74) = for mi <n S (i =0,1,2,---, ad inf). 
The sequence {n,;} (¢=0, 1, 2, - - - , ad inf.) of integers is chosen as follows: 
Denote by F,(A, g) the expression 

Lu.b. | P[w.(8, s)| — d, 9) | 6] |. 


The integer #9 is put equal to zero and n; is chosen so that 
> Ni-r, 


for all n>, and {d,} denotes a sequence of numbers such that lim \;= «. 
Let Ad =n and ge for ny <n (¢=0, 1, - - - , ad inf.). From 
(72), (73) and (75) we obtain 


(75) 


(76) lim {I.u.b. | P[W.(@, 6] 9, ge) | 0] |} =O. 


Denote by W,*(W,, 9, X) the intersection of W,*(W,) and the region 
n/?|@—6,| Sd. Because of (74) we obviously have 


9, = WalWry 8, 
Hence from (76) we obtain 


(77) {1.u.b. | P[W.(9, 4’) | 0] — 8, | |} = 0. 


Since lim,.. Ax = ©, it follows from (77) that 
(78) lim {L.u.b. | P[w.|¢] — B[wi(w,) | =0. 
6,W, 


The region W, may be any Borel measurable subset of the rm-dimensional 
sample space. In particular, W, may be any Borel measurable subset R,* in 
the space of the maximum likelihood estimates. Hence Lemma 2 follows from 


(78). 
On the basis of Lemmas 1 and 2 we can restrict ourselves in case of large 


[November 
n=o 


1943] TESTS OF STATISTICAL HYPOTHESES 445 


samples to subsets of the space of the maximum likelihood estimates and we 
can substitute B[V,*|6] for P[V,*|0]. Hence, if the sample is sufficiently 
large, the problem of statistical inference concerning the unknown parameter 
@ can be reduced to the case where the variates involved have a joint normal 
distribution. 

5. Tests of simple hypotheses which have uniformly best average power 
over a family of surfaces. For any value c let K, denote a surface in the 
parameter space. For instance K, may be defined by the equation $(6) =c 
where ¢(6) denotes some analytic function of #. Consider furthermore a non- 
negative function w(@) of 0, called a weight function. For zuy function ¥(6) 
of @ the symbol fx¥(0)dA will denote the surface integral of the function 
¥(0) over the surface K,. 


DEFINITION I. A critical region W,, is said to have uniformly best aver- 
age power with respect to the surfaces K, and the weight function w(@) if 
for any region Z,, of size equal to that of W, we have Sx.P(W,|0)w(0)dA 
=fx-P(Zn|0)w(0)dA for all values c for which K, is defined. 


Let y', - - - , y* be & variates which have a joint normal distribution. The 
mean values 6', - - - , 0* of the variates - - -, y* are unknown, but the 
covariance matrix ||¢;,|| (¢, 7=1, - - -, &) is known and is nonsingular. Sup- 
pose that we wish to test the simple hypothesis that 6=@». Consider the 
family of ellipsoids given by 


(79) — 60) 65] = 


where ||A,,|| =||o;,-. For any c denote by S, the ellipsoid given by (79). Con- 
sider a nonsingular linear transformation of the parameter space 


(80) 0” — 6) = — 6) + — 6) 


such that the family of ellipsoids S, is transformed into a family of concentric 
spheres with the center at #9. Denote by S/ the image of S,. For any point 0 
and for any positive p consider the set w(0, p) consisting of all points 6, which 
lie on the same S, as 6 and for which |6,—6| Sp. Let 
A [w’(6, p) | 

81 6) = lim ————— 
(81) = lim 
where w’(0, p) is the image of w(6, p) and for any set w, A(w) denotes the 
area of w. 


Proposition II. If the variates y', ---, y* have a joint normal distribu- 
tion with unknown mean values 0',---, 0* and a known covariance matrix 
\|o;,||, then for testing the hypothesis 0=0o on the basis of a single observation 
on each of the variates y', - - - , y*, the critical region given by the inequality 
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has uniformly best average power with respect to the surfaces S, defined in (79) 
and the weight function given in (81). 


Proof. Consider the linear transformation 


(83) — = Baly — 00) — 


where the matrix is the same as in (80). The variates ---, 
are normally and independently distributed with mean values 6}, ---, 0 
(under the hypothesis @=@ 9) and have a common variance o?. We will as- 
sume o?=1, since this can always be achieved by multiplying the matrix 
||8:;|| by a proportionality factor. The critical region W given in (82) will be 
transformed into the region W’ given by 

(84) (y= +(x" — 6) 2a. 


Because of (81) we obviously have 
(85) f P(Z| = f | @’)dA, 


where Z denotes an arbitrary region in the space of y’,---, y* and Z’ is 
the image of Z in the space of y’!, - - - , y’*. Hence in order to prove Proposi- 
tion II we have merely to show that 


(86) P(W'| @’)dA = P(2'| @’)dA 


for any region Z’ in the space of y’!, - - - , y’* which has a size equal to that 
of W’. 

By a lemma of Neyman and Pearson(*) we see easily that (86) is proved, 
if we can show that there exists a function d(c) of ¢ such that 


and 
(88) p(y’ | 6’)dA/p(y’ | 00) d(c) outside W’ 


for all positive values of c, where p(y’| 6’) denotes the joint density function 
of y’!, - - - , y’* under the assumption that the true means are - - - , 0’*. 


(*) J. Neyman and E. S. Pearson, Contributions to the theory of testing statistical hypotheses, 
Statistical Research Memoirs vol. 1 (1936). 
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If we denote y’‘—6} by v‘ and 0’*—6§ by 6*‘, we have 


1 
0) = “XP (— — 99) 
and 


1 


Hence 


f. exp (>, — 2-1 >> (0**)2)dA 


= exp (— 24 (9%) exp 


since > ,(6**)? is constant on the surface S/. Hence (87) and (88) can be writ- 
ten 


(89) I(v',---, 0%) = exp >, = d*(c) within W’, 


(90) I(o', ---, v*) S d*(c) outside W’. 


Denote | by 7, and | (6*4)2)¥/2| by r*. On the surface S/ we 
have r*=c. Denote by a(6*) the angle (0SaS7) between the vector v and 
the vector 6*. Then we have 


I(v',---,o*) = f exp (cr, cos [a(6*) ])dA. 


Because of the symmetry of the sphere, the value of this integral will not be 
changed if we substitute 8(0*) for a(@*) where 8(0*) denotes the angle 
(0 $8(@*) S7) between the vector 6* and an arbitrarily chosen fixed vector u. 
Hence I(v!, -- +, v*) depends only on r,, that is I(v',---, v*)=I(r,). The 
inequalities (89) and (90) are obviously proved if we can show that I(r.) 
is a monotonically increasing function of r,. We have 
dI(r, 
(91) f cos [8(6*)] exp (crs cos ]) dA. 
To 8 

Denote by w: the subset of S? in which 0 $8(6*) S7/2, and by w the subset 
in which 27/2 58(6*) Sz. Because of the symmetry of the sphere we obvi- 
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ously have 


f c cos [8(6*) ] exp (cr, cos [8(6*) ])dA 


= f c cos [x — 8(6*)] exp (cr, cos [x — B(6*) ])dA 


=— f c cos [8(6*)] exp (— cr, cos [6(6*) ])dA. 


f cos [8(6*) ] {exp (cr, cos [8(6*) ]) —exp (—cr, cos [8(6*) ])}dA. 


The right-hand side of (93) is positive. Hence Proposition II is proved. 
Now let us turn back to the general problem of r variates x',---, x* 

whose joint probability density function f(x!, ---, x’, 0!,---, 0*)=f(x, 0) 

involves unknown parameters, as considered in the previous sections. 


DEFINITION II. A sequence {W,} (n=1, - - -, ad inf.) of critical regions 
of size a for testing the simple hypothesis 0=0o is said to have asymptotically 
best average power with respect to the family of surfaces K., and the weight func- 
tion w(6) if for any sequence {Z,} for which P(Z,| 60) =a we have 


lim sup | I. P(Z,| 8) dA — P(W,| 6) ia} <0, 


where 


A(K,) = w(0)dA. 


We shall prove the following theorem. 
THEOREM I. Let W,* be a critical region for testing 0=0, defined by the in- 
equality 
n>, (6; = 6) (6% 63) = 
i 


where the real number d, ts chosen so that P( We| 00) =a. Denote by S, the sur- 
face in the parameter space defined by the equation 


 — 00)(6" — 00) = 


Furthermore let £(0) be the weight function as defined in (81) where ||cs;(40)]| as 
substituted for ||X,,||. Then the test { W.*} has asymptotically best average power 
with respect to the family of surfaces S, and the weight function §(@). 


[November 
Hence : 

dI(r) 
93) ——- = 
- 
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Proof. Because of Lemma 1 we can restrict ourselves to subsets of the 
space of the maximum likelihood estimates. Let us assume that Theorem I 
is not true. Then it foliows from Lemma 2 that a sequence of values {ca} 
and a sequence of regions {Z,*} exist such that P(Z,*| 00) =a and 


(94) lim sup 4 — =3>0, 


where 


= £(0) / £(0)aA. 


From (94) it follows that there exists a subsequence {’} of the sequence {n} 
such that 


(95) tim J, | ore J, 2072 loz =5>0. 


It is easy to verify that 

lim | = 1 

R= 
if 0,- is a point of S.,, and if lim,.. 2’C,» = -+ ©. Under the latter condition 
also 


lim | = 1. 
Sens 


Thus (95) can hold only if the sequence { ncn} is bounded. If { n'cn } is 
bounded, we obviously have for any sequence of regions { V,* 


lim { 30% | — f | = 0, 


where P(V,* | 6) denotes the probability of V,* calculated under the assump- 
tion that »/2(6}—61), - - - ,n/2(6%—6*) have a joint normal distribution with 
zero means and covariance matrix equal to ||c;;(9»)||-!. Hence from (95) we 
obtain 


(96) lim { Ii. | — f POW! | =5>0. 


Denote by W,* the region defined by the inequality 


n> (6, — — = da, 


where d, is chosen so that P(W,*| 00) =a. Furthermore denote by Z,* the 
sum of Z,* and the region n/?|@>—6,| Ss, where dq is chosen so that 


= 
Z| 
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P(Z,*| 00) =a. Since, as can easily be seen, 


lim { f POWs: | — Paws: | =0 


* 
lim { f — f Pea | = 0, 
n=@ Scar Sens 
we obtain from (96) a contradiction to Proposition II. Hence Theorem I is 
proved. 

6. Tests of simple hypotheses which have best constant power on a family 
of surfaces. 


DEFINITION III. A critical region W,, for testing 0=0, is said to have uni- 
formly best constant power on the family of surfaces {K.} af the following two 
conditions are fulfilled: 

(a) P( W,| 6,) = P( W,.| 62) for any pair of points 6,, 02 which lie on the same 

surface K-.. 
(b) P(W,|0)2P(Z,/0) for any Z, which satisfies condition (a) and for 
which P(Z,| 00) =P(W,| 00). 

From Proposition II we obtain the following: 

ProOposITION III. Let y',---, y* be k variates which have a joint normal 
distribution with unknown mean values 6',---, 0* and a known covariance 
matrix ||o;;|| =||d«j||-!. Then for testing 0=0o, the region defined in (82) has uni- 
formly best constant power on the surfaces S, defined by the equation 


Since the critical region defined in (82) satisfies condition (a) of Defini- 
tion III, Proposition III is an immediate consequence of Proposition II. 


DEFINITION IV. A sequence of critical regions { W.} for testing 0=0o is 
said to be of size a and to have asymptotically best constant power on the surfaces 
K, tf the following three conditions are fulfilled: 

(a) P(W,| 00) =a (n=1, 2, - - - , ad inf.). 

(b) {I.u.b.. [l.u.b.eex, P(W,| @) —g.l.b.cex, P(W,|0)]} =0, where 
the symbol |.u.b.cex, means that the least upper bound is to be taken with respect 
to 0 restricting 0 to points of Ke. 

(c) For any sequence {Z,} which satisfies (a) and (b) we have 


lim {l.u.b. [P@,| 0) — P(W.| = 0. 
n= 


It is easy to verify that the sequence { W,*} defined in Theorem I satis- 
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fies the conditions (a) and (b) for K.=S., where S, denotes the surface de- 
fined in Theorem I. Thus from Theorem I we obtain the following theorem. 

THeEoreEM II. Let {W,*} and S. be defined as in Theorem 1. For testing 
0=0o, the sequence {w.*} has asymptotically best constant power on the sur- 
faces S.. 

7. Most stringent tests of simple hypotheses. Let @ and 6) be two parame- 
ter points and let a denote a positive number less than 1. We denote by 
P,(@, 90, «) the least upper bound of P(W,| 0) with respect to W,, where W, 
is restricted to regions for which P(W,.| 00) =a. It is clear that if W, is a 
critical region of size a for testing =, its power function can nowhere ex- 
ceed the value of P,(0, 60, aw), that is P( W,| 6) SP,.(8, 90, x) for all values of @. 


DEFINITION V. A critical region W,, ts said to be a most stringent test of the 
hypothesis 0 =0, on the level of significance a if P(W,| 00.) =a and if 


lu.b. [P,(, 0, a) — P(W,| 6)] l.u.b. [P,(0, a) — P(Z,| 
6 


for all regions Z,, for which P(Z,.| 60) =a. 
We shall prove the following proposition. 


PROPOSITION IV. Let y', ---, y* be k variates which have a joint normal 
distribution with unknown mean values 0',---, 6* and a known covariance 
matrix ||o;;|| =||d«,||-!. Then for testing 0=6 the region W defined in (82) is a 
most stringent test. 


Proof. We shall assume that Proposition IV does not hold and we shall 
arrive at a contradiction. If Proposition IV is not true, then there exists a 
region Z in the space of y’, - - - , y* such that P(Z| 60) =a and 


(98) [P(@, 0, a) — P(W| > Lu.b. a) — P(Z| 
6 6 
Let S. be the surface defined by the equation 
i 


The functions P;(9, 00, a) and P( Ww 6) are constant on the surface S,. Hence, 
on account of (98), there exists a value ¢» such that 


P(Z| 6) > P(W| 6) 
for all points @ on S,,. But this is a contradiction to Proposition II. Hence 
Proposition IV is proved. 


DEFINITION VI. A sequence of critical regions {W,} is said to be an asymp- 
totically most stringent test of the hypothesis 0 =O on the level of significance a 
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if P(W,|00) =a and if for any {Z,} for which P(Z,| 6s) =a we have 
lim sup {1.u.b. [P.(0, 00, P(W,| @)] — l.u.b. [P.(0, %, a) — P(Z,| 6)]} <0. 


n= 
We shall prove the following theorem. 


THEOREM III. Let W,* be the region defined in Theorem 1. Then the sequence 
{W,*} is an asymptotically most stringent test of the hypothesis 0 =O. 


Proof. Denote by $, (0, 4, a) the least upper bound of $(Z,*| 6) with re- 
spect to Z,*, where Z,* is restricted to regions in the space of the maximum 
likelihood estimates for which B(Z,*| 00) =a. Because of Lemma 1 we have 
(99) lim [P.(0, 40, «) — Bn(8, G0, = 0 
uniformly in 6. Denote by P,(8, 0, «) the least upper bound of P(Z,*| 6) with 
respect to Z,*, where Z,” is restricted to regions in the space of the maximum 
likelihood estimates for which P(Z,*| 60) =a. The symbol P( V,*| 6) denotes 
the probability of V,* calculated under the assumption that the joint dis- 
tribution of n/2(61—6"), ---, m”2(6%—6*) is normal with zero means and 
covariance matrix ||c;;(@9)||-1. For any positive \ we have 
(100) lim [B,(0, a) — P,(8, 4, «)] = 0 


uniformly in @ in the domain | 2/2(@—o)| SX. Since for any sequence {6,} 


for which lim | 2*/2(0,, —Oo) | =-+ ©, we have 


lim (On, a) = lim P (Ons 6o, a) 1, 


we obtain from (100) 
(101) lim %, a) — P,(0, 0, a)] = 0 


uniformly in 6. For any c let S, be the surface defined by 

j=1 
Obviously P,(6, 40, «) is constant along the surface S,. From (99) and (101) 
we obtain 
(102) lim {l.u.b. 0, — g.l.b. Pa(0, 0, a)} = 0 

0CS8, 

uniformly in @. We shall derive a contradiction from the assumption that 
Theorem III is not true. If Theorem III is not true, there exists a sequence 
{Z,} of regions such that P(Z,| 0) =a and 


— 
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lim sup {1.u.b. [P.(0, 0, a) — P(W*| 


— Lub. [P.(0, 0, a) — P(Z,| 6)]} =5>0. 
On account of (102) and since 


lim [Lu.b. P(Ws| 6) — g.l.b. P(W2| 6)] = 0 

uniformly in c (see Theorem IT), we obtain from (103) that there exists a 

sequence {c,} and a subsequence {n’} of {m} such that for all points 0, 

of 


PZ: | On) > | On) + 8/2 


for all » greater than a certain mo. But this contradicts Theorem I. Hence 
Theorem III is proved. 

8. Definitions of “best” tests of composite hypotheses. In this section we 
shall extend the definitions given in the previous sections to the case of com- 
posite hypotheses. Let w be a subset of the parameter space and denote by 
H,, the hypothesis that the true parameter point is contained in w. In all 
that follows the letter @ printed in boldface will indicate that the parameter 
point lies in w. For example, the symbol |.u.b.¢ f(@) denotes the least upper 
bound of the function f(@) with respect to 6 where 6 is restricted to points of w. 
For any point 6 and for any real value c let K.(6) denote a surface in the pa- 
rameter space. For instance K.(@) may be given by r equations in 0 


6) = --- = ¢,-(0,0) = 0, 
where ¢:(0, - - - , 6) are some analytic functions of and 6. 


DEFINITION VII. A critical region W, for testing H., is said to have uniformly 
best average power with respect to a family of surfaces K.(0) and a weight func- 
tion w(0) if for any Z, for which 


Lu.b. P(Z,|@) = Lu.b. P(W,| 6) 


we have 


P(W,| 0)w(@)dA = P(Z,,| 0) w(6)dA 


for any 0 and for any c for which K.(@) is defined. 


DEFINITION VIII. A sequence (n=1, 2, +--+, ad inf.) of critical re- 
gions for testing the hypothesis H, is said to have asymptotically best average 
power with respect to a family of surfaces K.(0) and a weight function w(@) if 
the following two conditions are fulfilled: 
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(a) There exists a fixed a such that 
Lu.b. P(W,| 0) = a 


(b) For any sequence {Z,} for which |.u.b.e P(Z,| 6) =a, we have 
lim sup [ P(Z,,| @) 4 


PW,| aa |} <0, 


A[K.@)] = w(6)dA. 


DEFINITION IX. A critical region W,, for testing H., is said to have uniformly 
best constant power on the family of surfaces K.(®) if the following two conditions 
are fulfilled: 

(a) P(W,| 0’) =P(W,| 0’) for all pairs of points 0’ and 0’’ which lie on the 
same surface K,.(6). 

(b) P( W,| 6) > P(Z,| 6) for any @ not in w and for any Z, which satisfies 
(a) and the condition 


Lu.b. P(Z,|@) = Lu.b. P(W,| 
8 


DEFINITION X. A sequence of critical regions {W,} for testing H. is said 
to have asymptotically best constant power on the surfaces K.(®) if the following 
three conditions are fulfilled: 

(a) l.u.b.e P(W,| @) =a (n=1,2,---, ad inf.). 

(b) {1.u.b..6 P(W.|6)]} =0. 

(c) For any sequence {Z,} which satisfies (a) and (b) we have 


lim {l.u.b. [P(Z,| — P(W,| = 0, 


where @ is the complement of w. 


DEFINITION XI. Denote by P,(0, w, a) the least upper bound of P(Z,| 6) 
with respect to Z,, subject to the condition |.u.b.6 P(Z,| 0) =a. A critical region W,, 
ts said to be a most stringent test of the hypothesis H., if for some positive a 


Lu.b. P(W,| 0) =a 
8 
and 
L.u.b, [P.(0, w, a) — P(W,| Lu.b. [P.(0, w, a) — P(Z,| @)] 


for all regions Z, for which |.u.b.6 P(Z,|®) =a. 


[November 
where 
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DEFINITION XII. A sequence of critical regions { W,.} is said to be an asymp- 
totically most stringent test of the hypothesis H,, if the following two conditions 


are fulfilled: 
(a) There exists a positive a such that 


Lub. P(W,|0)=a (mn =1,2,---, ad inf.). 


(b) For any sequence {Z,} which satisfies (a) for the same a we have 
lim sup {Lu.b. [P,(0, a) — P(W,| 6)] — Lub. [P.(0,,a) — P(Z,| <0. 


n=o 6 
In Definitions VII—-XII we have formulated the condition 
lu.b. P(W,| 0) = a. 
6 


The question can be raised whether, in place of this condition, the require- 
ment that 

(*) P(W,|0) =a 

for all points @ should be made; or whether the weaker condition that 

(**) lim P(W,|0) = « 


N= 
uniformly in @ should be required. Condition (*) has the serious drawback 
that regions satisfying it do not always exist. Even in cases where (*) can 
be fulfilled, it imposes too strong a restriction on the possible choice of W,, 
which does not seem to be quite justified. It is conceivable that in some cases 
a region W,’ may exist which does not satisfy (*) but has such an advanta- 
geous power function that we prefer it to any region W, which satisfies(*). 

As to the condition (**), we shall see that it is satisfied for the sequence 
{17,.} which is shown in this paper to be asymptotically best according to all 
three definitions VIII, X and XII. Hence the same sequence { W,} remains 
asymptotically best if we replace the condition l.u.b.e P( W,| 6) =a by (**) in 
the definitions VIII, X and XII. 

In the following §§9-11 we shall discuss a linear hypothesis of the follow- 
ing type: 6 =05 (r<k), where 0,---, 05 are some specified 
values. That is to say, the set w is the set of all points @ for which the above 
equations hold. In §12 the general composite hypothesis will be discussed. 

9. Tests of linear composite hypotheses which have uniformly best 
average power over a family of surfaces. Let H. be the hypothesis that 
eo =61,---+, 6°=65 (r<k). We shall introduce the following notation: For 
any parameter point 0=(6!, - - - , @*) the symbol 6 will denote the vector 
in the r-dimensional space with the components 6, - - - , 6’, and 26 will de- 
note the vector in the k—r dimensional space with the components 
Or+1, ..-, 6. For any function (6) of @ we shall use the synonymous nota- 
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tion ¥(,0, 2). For instance P( W| 19, 28) is synonymous with P( W| 6). 

Let y',- ++, y* be & variates which have a joint normal distribution 
with unknown mean values @!,-+-, 6@* and known covariance matrix 
=1,---, &), which is nonsingular. Denote by W the 
region in the space of y', - - - , y* given by the inequality 


(104) XX Koay” — — 00) 
q=l 
where ||Xpql| (2, g=1, - +--+, 7). Consider the nonsingular linear 
transformation of the variates y', - - - , y* given by the equations 
0 = Byly — 60) 6) (P= 
yt =yay + (¢=r+1,---,&), 


such that y’!,---, y’* are independently distributed with unit variances. 
Denote by S.(@) the surface given by the equations 


— — 6s) =<, 
(106) pap > Pq o)( 
+ + = (@=r+i1,---, &). 


t=1 


(105) 


Consider the transformation of the parameter space given by 


— = Byi(O — 6) + — 6) (p= 1,--+,”), 
Ot = + (¢=r+i1,---,&), 


where the coefficients 8,, and y,; are the same as in (105). The transforma- 
tion (107) transforms the surface S.(@) into a sphere S (6) given by 


(107) 


r k 
p=1 i=1 


For any point @ and for any positive p consider the set w(0, p) consisting of 

all points *@ which lie on the same S,(6) as 0, and for which | *9—0| Sp. Let 
A |w’ (6, 

(109) £0) = tim 

e=0 A[w(8, p) 

where w’(0, p) is the image of w(@, p) (by transformation (107)) and, for any 

set w, A(w) denotes the (r—1)-dimensional area of w. We shall prove the 

following proposition. 


ProposiTIOn V. Let y',---, y* be k variates which have a joint normal 


distribution with unknown mean values 0', - - - , 0* and known covariance ma- 
trix ||o%;||=||d,||-!. For testing the hypothesis 9=105 on the basis of a single 
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observation on each of the variates y',---, y*, the critical region W given in 
(104) has uniformly best average power with respect to the family of surfaces 
S.(0) defined in (106) and the weight function £(6) given in (109). 


Proof. Because of (109) we have 
(110) f P(Z| 0)£(0)dA = f P(Z' | @’)aA, 


where Z denotes an arbitrary region in the space of y',---, y* and Z’ is 
the image of Z by transformation (105). The region W is transformed into 
W’ given by 

1 a r 
(111) (y" 6) 24. 


In order to prove Proposition V we have merely to show that 


(112) f P(W’| = f P(Z' | 0’)dA 
s2(@) 


for any c>0, for any ®, and for any region Z’ in the space of y’!,---, y’* 
for which I.u.b. P(Z'| 180, 20’) =1.u.b. P(W’| 100, 28’). For any point 6’ of 
Si (6) we have 26’ =20’. By a lemma of Neyman and Pearson, (112) is proved 
if we can show that there exists a function d(c) such that 


p(y", +, 16’, = d(c) within W’, 


(113) 


| 190, 2’) s d(c) outside W’ 


for all values of c and @ where p(y"!, - - - , y’*|@”) denotes the joint probability 
density of y’!, ---, y’* under the assumption that 6’ is the truefparameter 
point. Obviously 
Hence (113) is equivalent to 
---, = d(c) within W’, 
p(y’, 100) S d(c) outside W’. 
The proof of (114) is omitted, since it is the same as that of the inequalities 


(87) and (88). Hence Proposition V is proved. 
Let the critical region W,,* be defined by the following inequality 


(114) 


(115) nd — 02)(65 — dm 


q=1 p=1 
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=1,----,). The constant d, is chosen so that 
lu.b. P(W*| 0) = a. 
6 


Let Z}(0) =n2(66—0*) (¢=1, +--+, and consider the nonsingular linear 
transformation 


Z2(0) = Byr(9)Zn(8) + + Byr(0)Z0(6) (p = 1,---,n), 
Z.(0) = + + @=r+1,---,2, 


such that Z1(0), - --, Z%(@) would be independently distributed with unit 
variances if the covariance matrix of Z1(6),---, Zi(@) were given by 
l|o;;(0)|]. Denote by S.(6) the surface defined by the equations 


(116) 


(117) (0 — — = 


For any positive 6 denote by S; the set of all points @ for which 
|e” —68| (p=1,---, We shall prove the existence of a positive 6 
such that for any point @ in S; there exists exactly one surface S,(6), that is 
exactly one value of c and exactly one point @, such that @ lies on the surface 
S.(6). This statement is obviously proved if we show that for any point @ in 
S; the set of k—r equations 
(118) — +--+ + ¥2(0)(0* — 0*) = 0 (@=r+1,---,&) 
has a’ unique solution in the unknowns @*+, - - - , @*. From the definition 
of the quantities 8,,(@) and .:(@) it follows that 
(119) = 7, 


where A (6) denotes the matrix 


B1:(8) Bi2(6) + Bir(0) 
B12(0) - Bar(6) 


Br1(0) B,2(0) (0) 0 0 


A(®) is the transposed of A(@) and J denotes the unit matrix. Since ;;(@) 
is a continuous and bounded function of 6 and since the determinant | ¢;;(@)| 
has’a positive lower bound, we find that 8,,(@) and 7;;(@) are continuous and 
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bounded functions of @ and that the absolute value of the determinant A (6) 
has a positive lower bound. Hence also the absolute value of the determinant 


(121) 7) = 


has a positive lower bound. 

Let 6=6* where 6* denotes an arbitrary point of w. Then, since the de- 
terminant in (121) has a positive lower bound, the equations (118) have a 
unique solution in the unknowns @t', - - - , 6*, namely the solution 6=6*. 
Furthermore we see that the Jacobian of the equations (118), taken at the 
point 6=6*, is equal to (6*). Since the absolute value of y(@*) has a positive 
lower bound, there exists a positive 6 such that the equations (118) have a 
unique solution in 6 if |@—6*| <8. This proves the existence of a positive 6 
such that for any point @ in S; there exists exactly one surface S,(@) such that 
6 lies on S,(6). 

Since for the critical region W,* defined in (115) we obviously have 
lim, P(W,*|0)=1 uniformly over the domain | 6?—62| 26, we shall restrict 
ourselves to the consideration of points 6 for which | P—8 | Sé5(p=1,---,7r). 

Consider the transformation of the parameter space given by 


= + + =: ¢ + 1, k), 


where 6 denotes the point for which 6 lies on S,(@). The transformation (122) 
transforms S,(@) into the sphere given by 


(122) 


p=1 


We define a weight function &(0) as follows: 


A [w’(@, 


(124) = lim Aloe, 


where the symbols on the right-hand side have the same meaning as in (109). 


THEOREM IV. Let the critical region W,* for testing ,9=10o be the region 
defined in (115). Furthermore, let S.(@) be the surface defined in (117) and let 
£(0) be the weight function defined in (124). Then {W,*} has asymptotically 
best average power with respect to the family of surfaces S.(0) and the weight 
function &(@). 

Proof. Because of Lemma 1, we can restrict ourselves to subsets of the 
space of the maximum likelihood estimates. Because of Lemma 2, Theo- 


| 
| 
} 
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rem IV is proved if we show that for any {Z,*} for which l.u.b.e P(Z,*| 6) =a 
we have 


lim sup {iu.b.[ | [ | <0, 
s-(0) s-(@) 


where 


$n(0) = £(6) £(0)dA. 


If Theorem IV were not true, there would exist a sequence {c,}, a se- 
quence {6,}, a sequence {Z,*}, and a subsequence {n’} of {} such that 


Lu.b. 0) = a, 
6 


i { f | — | =5>0. 
n= 0 Sens (On) Sens Ons) 


It is easy to verify that for any sequence {c,} for which lim nc,=+© we 
have Sse. B(W,?| 6)¢,(@)dA =1 uniformly in 6. Hence (125) can hold only 
if the sequence {’c,-} is bounded. If {m’c,-} is bounded, for any sequence of 
regions { V,*} we obviously have 


a= at On, 


where Po( V,*| 0) denotes the probability of V,* calculated under the assump- 
tion that 2/2(6,—6"), - - - , m/2(6%—@*) have a joint normal distribution with 
zero means and covariance matrix ¢;;(6). Let W,*(@) be the region defined by 


>> (6, — — dn. 


q=1 p=l 


It is clear that if {n’c,-} is bounded, we have 
(127) lim { | 6) — Py[W2@)| =0 


uniformly in @ and 6 over the domain in which @ is a point of S,,-(@). From 
(125), (126) and (127) we obtain 


{ f 9, Zn" | 
Sens On) 
I 0,1 On’) | =6>0. 


Sens 


The surface S.(6) defined in (106) is identical with the surface S.(@) defined 


lim 
(128) 
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in (117) if in (106) we substitute c;;(6) for \;;, that is we substitute ¢,,(6) 
for Xp. Similarly, if we substitute c;;(@) for 4; for any point of the surface 
S.(6), the value of the weight function £&(@) defined in (109) is the same as 
the value of £(6) defined in (124). Hence, since 


lim [l.u.b. | = a, 
6 


equation (128) is in contradiction to Proposition V. Thus Theorem IV is 
proved. 

10. Tests of linear composite hypotheses which have best constant power 
on a family of surfaces. The critical region W defined in (104) satisfies condi- 
tion (a) of Definition IX if K,.(@) is equal to S.(@) given in (106). Hence from 
Proposition V we obtain the following proposition. 


Proposition VI. The region W given in (104) has untformly best constant 
power on the surfaces S.(@) defined in (106). 


If W,* is the region defined in (115) and if K.(@) is equal to the surface 
S.(@) defined in (117), then { W,*} satisfies conditions (a) and (b) of Defini- 
tion X. Hence, from Theorem IV we easily obtain the following theorem. 


THEOREM V. Let W,* be the region defined in (115) and let S.(@) be the sur- 
face defined in (117), then for testing 10 =100, { W,.*} has asymptotically best con- 
stant power along the surfaces S.(@). 


11. Most stringent tests of linear composite hypotheses. We shall prove 
the following proposition. 


Proposition VII. Let y’, -- +, y* be k variates which have a joint normal 
distribution with unknown mean values 61, - - - , 0* and known covariance ma- 
trix ||os;||=||d«,||—". For testing the hypothesis 19=10o on the basis of a single 
observation on each of the variates y', - - - , y*, the region W given in (104) is a 
most stringent test. 


Proof. First we shall show that P(@, w, a) is constant along S.(@) where 
S.(@) is defined in (106). Consider a linear transformation of y', ---, y* as 
defined in (105). Then the transformed variates y’!, - - - , y’* are independ- 
ently distributed with unit variances. Denote by 6’ the image of @ obtained by 
the transformation (107) and let P’(0’, w, a) be equal to l.u.b. P(Z | 6’) with 
respect to Z’, where Z’ may be any region in the space of y”!, - - - , y’* sub- 
ject to the condition that 


l.u.b. P(2' | 190, 20’) = a. 
28 


Obviously P’(6’, w, a) = P(6, w, a). 


| 
} 
| 
{ 
| 
| 
| 
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Hence we have merely to show that P’(0’, w, a) is constant along S/ {@) where 
Si (0) is the image of S,(0) and is given by 


p=1 


Let P*(0’, 100, a) be equal to the least upper bound of P(Z’| 0’) with respect 
to all regions Z’ in the space of y”!, - - - , y’* for which P(Z’| 160, 26’) =a. Ob- 
viously P*(0’, 100, ~) =P’(0’, w, a). It is easy to verify that the region V’ for 
which P(V’| 10’, 26’) =P*(0’, 190, a) is a subset in the space of - 
Hence 
P(v’| 19, 291) = 190, 292 ) 
for any pair of points @{ and 64, and therefore 
190, a) Pe, a) P(6, a). 


Since P*(0’, 190, a) is constant along S.(@), our statement is proved. From 
this and Proposition V, Proposition VII easily follows. 


THEOREM VI. Let W,* be the region defined in (115). Then {W,*} is an 
asymptotically most stringent test of the hypothesis 10 =10o. 

Proof. Denote by $,(8, w, a) the least upper bound of $(Z,*| 0) with re- 
spect to Z,*, where Z,* is restricted to regions in the space of the maximum 
likelihood estimates for which 


Lu.b. B(Z,| 6) = a. 
6 


On account of Lemmas 1 and 2 we have 


(129) lim { P,(0, — Ba(0, «)} = 0 


uniformly in 6. 

Denote by P,(8, 190, a) the least upper bound of P,(Z,*|6) with respect 
to Z,* where Z,* is restricted to regions in the space of the maximum likeli- 
hood estimates for which 


l.u.b. | 190, 20) = a. 
? 


The symbol P( V,*| 6) denotes the probability of V,* calculated under the as- 
sumption that the joint distribution of - - - , is nor- 
mal with zero means and covariance matrix ||c;,(@)||-!, where @ denotes that 
point for which 6 lies on the surface S.(@) defined in (117). It can be shown 
that for any positive \ we have 


(130) lim {B,(8, w, a) — 180, a)} =0 


n=o 
n= 
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uniformly in @ in the domain | 10 — 10o| S/n", Since for any sequence { 6n} 


for which 


lim @, a) = lim 190, a) = 1, 


we obtain from (130) 
(131) lim {B,(0, w, a) — 100, «)} = 0 


uniformly in 6. The function P,(6, 190, a) is constant along the surface S.(@) 
defined in (117). This can be proved in the same way as the constancy of 
P(0, w, a) on S,(6) defined in (106). Hence from (129) and (131) we obtain 


(132) Tim ub. P,(0, w, a) — P, (8, w, «)} = 0 


uniformly in @ and c. According to Theorem V we have 
(133) lim { Lu.b. P(W%| 6) P(W|0)} =0 
a=@ 8.(6) 8.(0) 


uniformly in c and 6. Theorem VI follows from (132), (133) and Theorem IV. 

12. The general composite hypothesis. In §§9-11 we have considered the 
linear composite hypothesis ,0 =149. Now we shall discuss a general composite 
hypothesis H, where w denotes a subset of the parameter space given by r 
equations 


(134) = (0) = --- = = 0 (r < &), 


.that is, w is the set of all points @ which satisfy equations (134). We make the 
following assumption. 


AssuMPTION VI. There exist k—r functions §*+1(0), - - - , £*(0) such that the 
following three conditions are fulfilled: 

(a) The transformation which transforms the point 0 into the point & with 
the coordinates §'(0), - - - , *(0) ts a topological transformation of Q into itself. 

(b) The first and second order partial derivatives of £'(0),--+-, &*(0) are 
uniformly continuous and bounded functions of 0. 

() The greatest — bound of the absolute value of the Jacobian 
-- - , &)/0(6", - - - , O*) ts positive. 


Let &=(é', ---, &) denote a variable point of the parameter space 2. 


we have 
n=o 
n= 

| 
| 
| 
| 
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Since according to Assumption VI the transformation 

(135) = 

is topological, we can solve the equations (135) and we obtain 

(136) = , (¢=1,---, &). 


From conditions (b) and (c) of Assumption VI it follows that the first 
and second order derivatives of 0*(é',---, & ) are uniformly continu- 
ous and bounded functions of & and the absolute value of the Jacobian 
0(0', -- 0*)/a(é', - - - , has a positive lower bound. 

Let f*(x, —) be the probability density function we obtain from the proba- 
bility density function f(x, 0) of x by substituting the right-hand side of (136) 
for 6‘. Hence f*(x, £) is the probability density function of x in the trans- 
formed parameter space. It is clear that the maximum likelihood estimate 
of £‘ is equal to #=£*(61, - - - , 6), where 6, is the maximum likelihood esti- 
mate of @. 

Denote by I*, II*, - - - , V* the assumptions which we obtain from As- 
sumptions I-V respectively by substituting f*(x, £) for f(x, 0), & for @ and 
£, for 6,. We shall show that Assumptions I *-V* can be derived from Assump- 
tions I-VI. 

Assumption I* is an immediate consequence of Assumptions I and VI. 
Since according to Assumption VI the first derivatives of (6) are continu- 
ous and bounded functions of 0, the transformation (135) is uniformly con- 
tinuous. Hence, for each positive e* there exists a positive ¢ such that the 
inequality | 6, —6| Se implies the inequality |én—é| Se*. From this and As- 
sumption II we obtain Assumption II*. 

Denote by »;(x, 6:, 5) the least upper bound, and by p;(x, 4, 6) the great- 
est lower bound of @ log f(x, @)/00* in the interval 6,—6 50 56,+5. Using the 
Taylor expansion we obtain 


(137) 


log f(x, 61) log f(x, + ot! 0’) 8? log f(x, 81) 


06; 0030? 


where §, lies in the interval [6,, 6;*]. From (137) it follows that 


A log f(x, 61) log f(x, 8 
(138) S 25D [| bis(x, 0, 8) | +] 6) |] 


for any @ and 6 for which 
6—-856,50+5 and 0-556; S0+6. 


From (138) we obtain for any positive 6 
(139) 0, 8) — 8, 8) | S [| 6, 8) | +| @, 8) |]. 
i 
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Let {@:.} and (m=1, 2, +--+, ad inf.) be two sequences of parameter 
points such that 


(140) lim (01, — 92n) = 0. 

According to Assumption III the expectations Eo,[¢i;(x, 62, 5)]? and 
Ez, [Wis(x, 62, 5) |]? are bounded functions of 6, 6, and 6 in the domain D, de- 
fined in Assumption III. Hence also the expectations Eo,|¢i;(x, 62, 5)| and 
6s, are bounded functions of 6;, and 6 in the domain D,. 
From this and relations (139) and (140) it follows that for any sequence { 6,} 
of positive numbers for which lim,.. 5, =0 we have 


(141) lim { Ben, 5n) Bon, 5,)} 0. 


Since 
» Gon 
Ban, 5n) s s v(x, Ben, 5n) 


it follows from (141) that 


06 


in { Den; bn) En, 


(142) 


lim { Bon, 5n) Ean 0. 


n=o 
Using the Taylor expansion we have 


8 log f(x, 9 log f(x, 
06% 


8? log f(x, 


(143) + — Gin) 


where 6, lies in the interval [6:., 02.]. Since the expectations Ep,|@:;(x, 62, 5)| 


and Es,| Wiilx, 62, 5)| are bounded functions of @;, #2, and 6 in the domain D,, 


we obtain 


lim X — 


Hence it follows from (143) and Assumption IV that 


(144) lim Ea, = lim bg Ain) _ 


We obtain from (142) and (144) 
(145) lim Bon, 5,) = lim x, Ben, bn) = 0. 
Denote by ¢%(x, £, 5*) the least upper bound, and by yj(x, &, 5*) the great- 


0. 


i 
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est lower bound of 0? log f*(x, in the interval SéS£,+5*. We 
have 


8? log f*(x, &) -x 8? log f(x, 6) 

log f(x, 0) 376! 

7 a6! 


Since 


log f(x, 
= 0, 
Es 
we obtain from (146) 


8? log f*(x, > log f(x, 90! 


47 E 


Hence the determinant 

log f*(x, &) log f(x, 6) | /A(@,--- , 6*)\? 

Since the determinant | — Ed? log f(x, 6) /80*36'| has a positive lower bound. 
it follows from (148) and Assumption VI that 


(149) |— log f*(x, | 
has a positive lower bound. 

For any positive 5* let 5(5*) be the smallest positive number such that 
for any two points & and & for which | #—é|<5* we have |@;—62| <4(*) 
where 6, and 6, are the image points of & and £ by transformation (136). 
From (146) we obtain 


(148) E; 


oo™ 


oe” 


Visa, é, *) Lvinl x, 6, 


9, 

where @ is the image point of ~ by transformation (136), and the derivatives 
06'/d£‘, ‘OE! are taken at some points in the interval [é—6*, £+6*], 
and the functions dim(x, 0, 5), Wim(x, 6, 5), 6, 5) and ¥,(x, 0, 5) satisfy the 
inequalities 


(150) 
‘ 
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Vim(%, 8, 5) S dim(x, 9, 5) S dim(x, 8, 8); 
Vim(x, 0, 8) S Pim(x, 9, 8) S 8, 8); 
0, 6) S 0, 8) S 8, 8); 
ui(x, 0, 6) S 6, 8) S 8, 8). 


Let {tn} and {£2} be two sequences of parameter points such that 
lim £:.=lim Let be the image point of 42, the image point 
of &2,, and @ the image point of & (by transformation (136)). Let furthermore 
{5,*} be a sequence of positive numbers such that lim 5,*=0. Then we ob- 
viously have lim 6(6,*) =0 and therefore using (145) and Assumption III we 
obtain 

lim = log f(x, 6)/20'30™, 


lim Es,,¥1m[2, = Eyd? log f(x, 6)/30'00™, 


lim [x, an, 5(52)] = 0, 


lim [x, = 0 


uniformly in @. From (150) and (151) and the uniform continuity of the deriv- 
atives 00'/0£* and we obtain 

0? lo 
(152) lim dn) = lim bx) = 


uniformly in &. 
Because of (150) we have both 


| & 8°)| and | & 8°) | 
s Vim [x, 0, | 


+ | dim|x, 9, 5(6*) ] Ee | } 


+ vi[x, 0, | +| mele, 6, |] } 


where the least upper bound with respect to é is to be taken over the interval 


+-5*]. 
We shall show that Es, [vs(x, 6, 5) }? and Ea, [us(x, Oe, 5) }? are bounded func- 
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tions of 6,, 6. and 6 in the domain D, for sufficiently small ¢. Our statement is 
proved if we show that Eg, [0 log f(x, 6:)/00*]?, Ee, [vi(x, 02, 5) log f(x, 
and Ep, [u:(x, 62, 5) —d log f(x, 6:)/0*]? are bounded in D,. The first of these 
expressions is bounded because of Assumption V. From (138) it follows that 


l 
vi(x, 02, 8) — | WE [| 5) | + | 2, 5’) |] 


and 


ui(x, 02, 8) — < [| bis(x, 8’) | + | 8’) |] 


where 5’=5+)_;|6{—63|. From the above inequalities and the fact that 
Eo, [bis(x, 62, 5) ]? and Es, 62, 6) ]? are bounded in D, it follows that for 
sufficiently small the expressions 02, 5)—@ log f(x, 0:)/80*]? and 
Eo, [pi(x, 02, 6) log f(x, 6:)/804]? are bounded in D,. Hence our statement 
is proved. 

Since the derivatives and are bounded functions of 
and since Eo, [¢i;(x, 62, 5) ]* and Eo, [Wi;(x, 2, 6) ]? are bounded functions of 
6,, 62 and 6 in the domain D,, it follows from (153) that there exists a positive 
e* such that , 5*) ]? and Eg, 5*)}* are bounded functions 
of £, & and 5* in the doniain defined by | t:—é| Se* and | 5*| Se*. Assump- 
tion III* follows from the latter statement and the relations (152) and (149). 

Assumption IV* is an immediate consequence of Assumption IV. 


We have 


log f*(x, &) 8 log f(x, 6) 


For any points x and @ denote the maximum of the & expressions 
log f(x, 6) /d6"|, log f(x, 0) by p(x, 0). From Assumption V 
it follows easily that 

(155) E,[o(x, 6) 


is a bounded function of @. Since the derivatives 064/d¢* are bounded func- 


tions of £, Assumption V* follows from (154) and (155). 
Denote by cj(@) the function of @ we obtain from — E,0? log f*(x, £)/d£E# 


by substituting £(@) for & Then we obtain from (147) 
* 06! =. 


Denote by A the matrix ||00*/d€4| (i, 7=1, - - - , &) and let Z be the trans- 


= 
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posed of A. Then equation (156) can be written as 
(157) = 


Let ||o%(6)|| be the inverse of the matrix ||c¥(@) ||. Futhermore let B be the 
matrix ||d¢*/00‘|| (i, j7=1, ---, &) and denote by B the transposed of B. 
Since B=A-, we obtain from (157) 


(158) = Blloi(||B, 
where ||o;;(8)|| =||c:;(8)||-?. Equation (158) can be written as 


‘ 


Let 
Denote by W,* the critical region defined by the inequality 


g=1 p=1 


where the constant d, is chosen so that 
Lu.b. P(W*| 6) = a. 


The point 6 is restricted to points of the set w defined by equations (134). 
For each positive c and for each point 6 of w we define the surface S.(@) by 


the equations 
X DX = 

(162) 

DX = +1,---, 2), 

j=1 
where the coefficients 7;;(@) satisfy the following condition: There exists a 
matrix ||8,¢(6)|| g=1, - - - , 7) such that if we form the matrix A(6) given 
in (120) then 
(163) = 1, 
where A(6) denotes the transposed of A(@) and J denotes the unit matrix. 

Consider the transformation of the parameter space given by 
= Byi(O)E*(0) + - - + Bor (O)E"(0) (p= 1,---,7), 


(164) 
= + + (@=r+1,---,k), 
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where @ denotes the point of w for which @ lies on S.(6) for some value of c. 
The matrix ||8,,(0)|| is chosen so that (163) is fulfilled. The transformation 
(164) transforms S,(6) into the sphere S/ (6) given by 


r k 


We define a weight function 7(@) as follows: 
(165) = lim A[w’(, p) ]/A [w(6, |, 


where the symbols on the right-hand side of (165) are defined as in (109). 
Since Assumptions I*-V* are fulfilled if Assumption VI holds, we obtain 
from Theorems IV, V and VI the following theorem. 


THEOREM VII. Let W,* be the region defined in (161) and let S.(6) be the 
surface defined in (162). Furthermore let n(@) be the weight function defined in 
(165). If Assumption VI holds, then for testing the hypothesis §'(0)= -- - 
= =0 the sequence { W,*} 

(a) has asymptotically best average power with respect to the surfaces S.(8) 
and the weight function (6); 

(b) has asymptotically best constant power on the surfaces S.(0); 

(c) ts an asymptotically most stringent test. 


13. Optimum properties of the likelihood ratio test. For testing a com- 
posite hypothesis H,, Neyman and Pearson introduced a statistic(®), called 
likelihood ratio, defined as follows: The density function in the sample space 
is given by []%_.f(xe, Denote by P(x, the maximum of this 
function with respect to 6',---, and let be the condi- 
tional maximum with respect to 6', - - - , #*, subject to the condition that 0 
must be a point of w. Then the likelihood ratio for testing the hypothesis H, 
is given by Aa(w, En) = Palms, xn)/ -- Xn). It is obvious that the 
value of A,(w, Z,) always lies between 0 and 1. Neyman and Pearson recom- 
mend the use of the left tail as critical region, that is the hypothesis H, is 
rejected if the value of \,(w, E,) is less than a certain constant \,(w). Denote 
the region \,(w, E,) <\n(w) by L,(w). In all that follows we choose the con- 
stant \,(w) so that 


Lab. P[L,(w) | 6] = a. 
We shall prove that there exists a finite value B such that —2 log \a(w) <B 
for all » and for all w. Consider the Taylor expansion 


(°) See in this connection J. Neyman and E. S. Pearson, On the use and inierpretation of 
certain tes! criteria for purposes of statistical inference, Biometrika vol. 20A (1928). 
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log f(%a, 6) = log f(%a; 9x) 
1 é 7 i 3? log 6,,) 1 


n 


where 6, lies in the interval [6,, @]. Since 
log b,) — log f(xa, 0) — log ra(w, En), 


we have 


n 00*00/ 


a 


Since log f(xa, 5,)/80‘004 converges stochastically to —c;;(6) under 
the assumption that @ is the true parameter point, it follows easily from 
Proposition I that for any e>0 there exists a positive value A(e) such that 


for any w 


= |o]\ se 


and lim... A(e) = + ©. Hence 
(166) lim sup { Lub. P[— 2 log = A()|0]} Se. 


This proves the existence of a finite number B with the required property. 

For any subsets I and I’ of the parameter space we denote by 4(T, I’) 
the greatest lower bound of the distance between 6 and @’ where @ is restricted 
to points of I’ and @’ is restricted to points of I’. We shall call 6(T', I'’) the 
distance of the sets T' and I’. 

Let {6} be a sequence of parameter points such that lim 6(0,, w) =0 and 
lim,e. #'/25(0,, w) = + ©. We shall prove that there exists a positive vp such 
that for any constant A 


(167) lim > log 4.) — log 0*) > A | | = 1, 


where 6* denotes a point of w for which |@*—0,| Sv» and log f(x.a, 
log f(xa, ®) for all @ in the domain |@—0,| Consider the Taylor 


expansion 
log 6*) — log f(%a, 6,) 


(168) 8? log On) 


n ¢ * i * 1 
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where 6, lies in the interval [6,, 6*]. Because of conditions (a) and (b) of 
Assumption III for any e>0 the positive number v9 can be chosen so that 


8? log f(xa, On) 


= 1. 


1 
(169) lim P| + — 
n=@ n a 

Since ||c;;(@)|| is positive definite and the determinant | c;;(@)| has a positive 
lower bound, we obtain from Proposition I, (168) and (169) that for some 
positive, vo, (167) holds. Hence our statement is proved. 

We say that the likelihood ratio test is uniformly consistent if for any 
positive v 

lim P[L,(w) | = 1 

uniformly in w and @ over the domain 6(@, w) =v. We postulate the following 
assumption. 

AssumpTION VII. The likelihood ratio test is uniformly consistent. 

This assumption together with the uniform consistency of the maximum 
likelihood estimate 6, will be proved in a forthcoming paper on the basis 
of some weak assumptions on the density function f(x, @). 


Let w,(@) be the intersection of w with the set of all points 6’ for which 
Di-1|0‘—6’*| =v. From Assumption VII it follows that for any positive v 


(170) lim P{ — 2 log An[w,(8), En] = — 2 log 0} = 1 


uniformly in @. 

Let be a sequence of parameter points such that lim... 5(0,, =0 
and lim,... 2"/25(0,, w) = + ©. Denote by w, the set of all points 6 for which 
|@—0,| Svo. Since —2 log X, [w,(8) ] has a finite upper bound it follows from 
(167) that for a sufficiently small vo 


(171) lim P{ — 2 log Aalwa, En] = — 2 log X,[w,,(0n)]| On} = 1. 


Obviously 

— 2 log d,(w, En) = minimum { — 2 log dn[w,,(8n), En], — 2 log a(n, En) }. 
From (170) and (171) we obtain 
(172) lim P{ — 2 log \a(w, En) = — 2 log On} = 1. 


Since —2 log Ax [w,,(8n), En] 2 —2 log An(w, En), we have —2 log Xn [w,,(0.) | 
= —2 log X,(w). Hence from (172) we obtain 


(173) lim P[L,() | @,] = 1. 
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From Assumption VII it follows that (173) holds for any sequence {6,} for 
which lim /?6(6,, w)= +. 

Now let us consider the case where w is the set given by 10=10o. Let {0,} 
be a sequence of parameter points for which the sequence 1/*5(,, w) is 
bounded. Denote by 6,(w) the parameter point in w for which 


II 6,(w) - P.(%1, 


Let 6, be the point for which 5(6,, w) = 5(0,, 6,). Denote by T,(v) the region 
given by the inequalities 


| 6, — Op | 


an | 6,(w) — 6, | <p. 


We shall prove that 
(175) lim P[T,(v) | @,] = 1 for any v > 0. 


Consider the Taylor expansion 


log = > log f(a, 6n) 


+2 


(176) 8? log f(%ay On) 


Since log converges stochastically to zero, 
and since n'/?|@{—@{| is bounded, we easily obtain from Proposition I that 
for any e>0 there exists a constant B, such that 


lim sup P[ > log On) — >> log f(xa, = B.| =. 


Since —2 log Aa(w, En) log f(xa, On) log f(xa, n) we have 
(177) lim sup P[— 2 log An(w, En) 2 2B.| On] S €. 


Denote by w, the subset of w in which |@—@,| 2v. From Assumption VII 
it follows that 


(178) lim P[— 2 log Aa(wn, En) = — 2 log X,(wn) | 0.) = 1. 


Since —2 log X,(w) S —2 log X,(wa) we have 
(179) lim P[— 2 log An(wa, En) = — 2 log Xq(w) | 0,4] = 1. 


For any given constant B there exists a positive a<1 such that —2 log X,(w) 
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2B if l.u.b.e P[L,.(w)| @] =a. Hence from (179) we obtain 
(180) lim P[— 2 log En) = Bl @,] = 1 


for any constant B. From (180) it follows that 
(181) lim {P[- 2 log Aa(w, E,) = B| On | P[| 6, (w) — 6,| = 20 


for arbitrary values of B. From (181) and (177) we obtain 
lim P[| 6.(w) — 6. | 2 »| @,] = 0. 


Hence (175) is proved. 
Consider the Taylor expansion 


log f(%a; On) = log 6,(w) 


(182) 


where 6, lies in the interval [0,, 6,(w) ]. In the following arguments we shall 
use the following lemma: Let ||d,;|| (é,7=1, - - - , &) be a definite matrix and 
for each integer s let Ni, be a real number such that lim... y=di;- Then 


lim vw di) = 1 


uniformly in v;, + + + , ¥». From (175), the Taylor expansions (176) and (182), 
and the above lemma it follows that for any e>0 


lim P[(1 + — (1 — €)Gn — 2 log An(w, 


183) 
( ) = (1 — €)Gn — (1 + €)gn| On] = 1, 


where 


(184) qn = } n(bn 6.) (6, 


and 


k k F ; 
(185) Gn = Dd — [6,(w) — 


r+1 r+1 


Since —2 log \,(w, Z,) 20, we obtain from (183) 
(186) lim P[(1 + gn — (1 — gn = 0| 0,] = 1. 
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Since the sequence {n/26(6,, 6.) } is bounded, the expression g, is bounded 
in the probability sense, that is for each positive p there exists a positive 
value A, such that 


lim > A,| On) 


From (186) it follows that @, is also bounded in the probability sense. Hence 
because of (183) we have 


(187) lim P[qn — Gn + € 2 — 2 log An(w, En) = gn — Gn — €| On] = 1 


for any e€>0. From the Taylor expansions 
log 


(188) 


=— 6, — 6,) — 


and 


(189) ya On) = (0 ) >> 


(¢=r+1,--- 
we obtain 


1/2 


Pes 


(191) (6, — 0.) = + niin(En) (i= 1,--- 


and 


k 


where 


= (i,j=r+1,--- 
and for any positive vy we have 
lim P[ | | = lim P[| nin(En)| < >| 
= lim P| <»| = 1. 


j 
| 
| 
| 
| 
| 
| 
| 
r 
| 
Hence | 
| 
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r+1 r+1 


r+1 


lim P[| pijmin(En) | < = 1 

for any positive v. If at least one of the integers / and m is greater than r, 
we have 


j=r+1 i=—r+1 


Hence 


m=1 j=r+1 i=r+1 l=<1 m=1 


m=1 lel j=r+1 i=r+1 


The coefficient 


k k 
A tm(On) = Cim(On) — Do Do 


r+1 r+1 
can be written as the following ratio: 


Ck r+1(8n) Cen (On) 


It is known that if A’ is the adjoint of any determinant A, and M and M’ 


: [November 
k 7 k k 
r+1 r+1 r+1 r+1 
r+1 
= LD + pijmin(En) — 0,)(62 — 07), 
j=r+1 é<r+1 m=<1 
where 
k k k k k k 
96 | 
| | 
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are corresponding m-rowed minors of A and A’ respectively, then M’ is equal 
to the product of A" by the algebraic complement of M. 

Let A be the k-rowed determinant |¢;;(6,)| (¢, j=1,---, %) and let M 
be the (k—r-+1)-rowed minors 


Tkm(On) CK r+1(On) 
Then we have 
Cim(0n) Cin (On) 


mr 6, 6, 6, 
(195) +1( ) Cr+i ) ) = 


where M denotes the algebraic complement of M. Let M, be the (k —r)-rowed 
minor | ¢;;(6)| 7=r+1, - - - , &). Then we obtain 
(196) Mi = = 


where M, denotes the algebraic complement of Mi. 
From (193), (195) and (196) we obtain 


= M/M, = E1m(0,) (l,m =1,---, r), 
where || Zim(6,)|| (2, m=1, -- +, 7). Hence 
q=1 p=1 


where for any positive v 
lim P[| npen(Ex)| < On] = lim P[| px(En) | < On] = 1. 


From (187) it follows that for any positive 


lim P{- 2 log An(w, S > > — — 
(197) 


+ Nven(En) $-2 log An(w, E,) + = 1. 


| 

| 

| 

| 
| 

n= 
| 

| 
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Since 02= 6 (p=1, - - - , r) from (197) we obtain because of the boundedness: 
of the sequence {n/2(8,—0,) } 


lim P[— 2 log Xa(w, En) — € S D> — 60) (65 — 


S — 2 log An(w, En) + €| = 1. 


The above equation remains valid if we substitute 6, for 6,, that is 


lim P[— 2 log Xa(w, Ex) — S md. Dd, — 66) (65 — 


(198) S — 2 log Z,) + e| 6,] = 1. 


Let W,* be the critical region defined in (115). Since (173) holds for any 
sequence {6,} for which lim m/26(0,, w) = ©, we obtain from (198) 


(199) lim { P(W3| 6) — P[Za(w)|@]} = 0 


uniformly in 6. 
Now we consider the general case where w is given by r equations 


= --- = = 0 


such that Assumption VI is satisfied. As we have seen in §12, the whole the- 
ory remains valid if we replace the parameters @', - - - , 6* by the new pa- 


rameters 


where the functions £'1(6@),---, £*(@) satisfy Assumption VI. Hence from 
(199) it follows that 


(200) lim { P(W;| 6) — P[La(w)|6]} = 0 


uniformly in 6, where W,* denotes the region defined in (161). 
From (200) and Theorem VII we obtain the following theorem. 


THEOREM VIII. Let S.(0) be the surface defined in (162) and n(@) be the 
weight function defined in (165). If Assumption VI holds, then for testing the 
hypothesis = -- = &(0)=0 the likelihood ratio test 

(a) has asymptotically best average power with respect to the surfaces S.(@) 
and weight function n(@); 

(b) has asymptotically best constant power on the surfaces S.(0); 

(c) és an asymptotically most stringent test. 


14. Large sample distribution of the likelihood ratio. S. S. Wilks(*) has 
derived the large sample distribution of the likelihood ratio \,(w, Z,) if w is 


(*) S.S. Wilks, Distribution of the likelihood ratio in large samples, Ann. Math. Statist. 1938. 


a= eo 
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a linear subspace of the parameter space and if the hypothesis to be tested is 
true. Here we derive the large sample distribution of },(w, Z,) for any set w 
satisfying Assumption VI in both cases, when the hypothesis to be tested is 
true, and when it is not true. 

Let m, - - - , u, be r independently and normally distributed variates with 
unit variances. Denote the expected value of u, by uy. The distribution of 
the statistic 


+m 


is known(’). The only parameter involved in this distribution is \?=?+ - - - 
+2. Let us denote the cumulative distribution of U? by F,(\?, #), that is, 


(201) PU OW +m). 


Obviously F,(0, #) is the x?-distribution with r degrees of freedom. 

Let 1, - - - , v, be r variates which have a joint normal distribution. Denote 
by uw, the mean value of v, and by o,, the covariance between v, and 2,. 
Consider the statistic 


(202) V2 = > > Ape? 


q=1 p=1 
where ||Ap¢l| =||op¢||-". It is easy to verify that the distribution of V? is given 
by 
(203) P(V? < #) = 2), 
where 
(204) os y 


We will now derive the limit distribution of the expression on the left- 
hand side of (161), that is of the statistic 


(205) On = (bn). 


q=1 p=l 


The joint distribution of the variates 
—£*(8) ] converges with n— © uniformly towards the cumulative normal dis- 
tribution with zero means and covariance matrix ||o%,(6)|| =|| 24,(6)||-1. Since 
6, is a uniformly consistent estimate of @ and since ¢;;(@) is a uniformly con- 
tinuous function of , the statistic 


(206) On = Dd 


(7) See for instance P. C. Tang, The power function of the analysts of variance tests, Statistical 
Research Memoirs vol. 2 (1938). 


| 
| 
| 
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has the same limit distribution as Q,, that is, 


(207) lim {P[0, < — P[Q, <#|e]} =0 


uniformly in @ and ¢. 
It is easy to see that 
(208) lim { P[0, < t| 6] — F,[d,(0), t] = 0 


uniformly in @ and ¢, where 


Hence, because of (207) we have 


(210) {P[Q, < 0] — =0 
uniformly in @ and ¢. 

Let {@,} (n=1, 2,- +--+, ad inf.) be a sequence of parameter points for 
which n/*5(6,, w) is bounded. Then we obtain from (198) 


(211) lim P[— 2 log Aa(w, En) — S On S — 2 log An(w, + €| = 1 


for any positive ¢. From (210) and (211) it follows that 


(212) lim { P[— 2 log Xa(w, En) < t| — F,[da(@,), #]} = 0 
uniformly in ¢. Since (173) holds for any sequence {6,} for which lim n/?5(0,, w) 
= + ©, we obtain from (212) 


(213) lim { P[— 2 log Xx(w, En) < 6] — #]} = 0 


uniformly in @ and ¢t. Hence we have proved the following theorem. 


THEOREM IX. Let F,(d?2, t) be the distribution function defined in (201) 
and let X,(#, E,) be the likelihood ratio statistic for testing the hypothesis 
£1(0)= -- =£*(0)=0. Let furthermore d2(0) be the expression defined in (209). 
Then, if Assumption VI holds, we have 


lim { P[— 2 log Xa(w, < t| 0] — t]} = 0 
uniformly in t and 0. If the hypothesis to be tested is true, that is if 0 is a point 
of w, 2(0)=0 and therefore the limit distribution of —2 log Xn(w, E,) ts the 
x?-distribution with r degrees of freedom. 
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15. Summary. Let f(x!,---, 0',---+, 0*) be the joint probability 
density function of the variates x!, - - - , x" involving k unknown parameters 
6',---, 0. Any set of values 0', - --, 6* can be represented by a point 0 
of the k-dimensional Cartesian space with the coordinates 6', - - - , 0*. Let w 
be the subset of the parameter space defined by the equations 


(6) = = --- = = 0 (r 


that is, w is the set of all points @ for which the above equations are fulfilled. 
Denote by H, the hypothesis that the true parameter point @ is an element 
of w. In this paper the question of an appropriate test of the hypothesis H, 
is discussed when the number of observations is large. 

The following notations have been introduced. The point 6, denotes the 
point with the coordinates 6}, ---, 6: where 6! is the maximum likelihood 
estimate of 6‘ based on m independent observations on x’, - - - , x™. The ex- 
pected value of log f(x!, ---,x™, 61, - , 0*)00400/ is denoted by c;;(@) 
(p, g=1, - +--+, 7) is denoted by o},(6) and 


The region W,* denotes the critical region defined by the inequality 


de 


q=1 p=l 


where is the number of independent observations on x’, - - - , x™, and the 
constant d, is chosen so that the least upper bound of the probability that 
the sample point falls within W,*, calculated under the restriction that the 
true parameter point lies in w, is equal to a given positive a <1. 

Let X, be the likelihood ratio statistic for testing H,, and let L,, be the criti- 
cal region defined by the inequality 


An Xn, 


where the constant X, is chosen so that the least upper bound of the probabil- 
ity that the sample point falls within L,, calculated under the restriction that 
the true parameter point lies in w, is equal to a. 

Under certain assumptions on f(x!, - +--+, x™, 6’, ---, 6*) and the func- 
tions £'(@), - - - , &°(@) the following results have been obtained: 

I. For testing the hypothesis H, the critical regions W,* and L, both: 
(1) have asymptotically best average power over a family of surfaces defined 
in (162); (2) have asymptotically best constant power along the surfaces de- 
fined in (162); (3) are asymptotically most stringent tests. The exact defini- 
tions of these notions are given in Definitions VIII, X and XII, respectively. 

II. The statistics —2 log and have the 


| 
| 
| 
| 
| 
| 
if 
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same limit distribution. The limit distribution of —2 log X, is the x?-distribu- 
tion with r degrees of freedom if the hypothesis to be tested is true. If the 
true parameter point 6, is not an element of w, the distribution of —2 log A, 
approaches the distribution of a sum of non-central squares 


2 2 2 
U Ur, 
where the variates m1, - - -, “, are independently and normally distributed 
with unit variances and 
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ON SOME TRIGONOMETRIC SUMMABILITY METHODS 
AND. GIBBS’ PHENOMENON 


BY 
OTTO SZASZ 


1. Introduction. We consider Fourier sine series 


(1.1) 6, sin v8, 6, sin v6 = s,(6). 


Suppose that f(+0) exists and is greater than 0, that is f(@) has a simple 
jump 2f(+0) at @=0. We say that the series (1.1) presents Gibbs’ phenome- 
non, if 


(1.2) lim sup s,(8,) > f(+ 0), as 6, | 0. 
More generally, if only for some k20 


fe = 7 > 0 

exists, and lim sup s,(0,) >j/2, we say that the series (1.1) presents a gener- 
alized Gibbs’ phenomenon at @=0; j is the generalized jump of f(@) at @=0. 
Our aim is to find general conditions for f(@) jor its Fourier coefficients, which 
imply a Gibbs’ phenomenon. It is known that the jump 7 is closely connected 
with the asymptotic behavior of the sequence {nb,}; on the other hand 

sin v6, 


= y vb, 
1 
is a linear transform of {nb,}, with the triangular matrix a,,=v~! sin Op, 
y=1, 2, ---+,m. In aprevious paper [3](*) we have discussed the relationship 


of the transform 
sin v6, 


T,(6,) = T, 


to the Cesaro means of the sequence {r,}. As an application we have proved 
the presence of a Gibbs’ phenomenon under general assumptions and we gave 
new formulae for the determination of the jump 2f(+0). More general re- 
sults are given in the present paper; the knowledge of (3) is not assumed. 

We consider trigonometric linear transforms, related to T,. We discuss in 
particular the transforms 


Presented to the Society, April 24, 1943; received by the editors April 24, 1943. 
() Numbers in brackets refer to the literature cited at the end of this paper. 
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1 n 
B, = { + H, = T.(0,), 


1 
S, = T,(é)dt. 


In §5 we prove a theorem on (C, 2) summability of the sequence {nbn} for 


Fourier series. 
We shall make repeated use of the well known theorem: 


THEOREM A. The convergence of a sequence {r,} implies the convergence of 
the transform T,=)_*dnT,, if and only if 
lim = a,exists for v=1,2,3,---, 


> = O(1), as no, 


veel 
lim >> Gn, = o exists. 
We then have lim T, =¢ lim @,(7,— lim 7,)(*). 


2. The average (1/2){ 7.(8,)-+ Ta(¢n) } = Bn. On putting p,=1 in Theo- 
rem 2 of our paper [3]-we get easily the following theorem. 


THEOREM 1. Suppose that 
1 n 
(2.1) >0, 6,20, 0, =O(1), nb, = O(1), 
nm 1 


and thata(a) = J¢(t-'sin )dt has the same value for all limit points a of the sequence 
{n0,}, and the same value for all limit points B of the sequence {nda}; then 


(2.2) B, = (1/2)(o(a) + o(8)). 


For the special case ¢, =2/n see Rogosinski [1, 2]. 

We next assume (C, 2) lim Let =>_*7,, tr, then our as- 
sumption is: We write A°r,=Ta, A*t,=A(Ar,); 
then 


B,= 


sin + sin 2 sin + sin nd, 
2 +7, . 


n 


n—2 2y sin v0, + sin vd, + 2 { sin (n — 1)0, + sin (m — 1)¢, 


Tn—1 
n—1 


n 


(?) The statement at the beginning of §2 in [3] should be corrected accordingly; it does not 
affect the-rest of the paper. 
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It follows from Theorem A that, if in addition to the assumptions of Theorem 1, 


sin (w — 1)0, + sin (nm — 1)¢, 


n—1 


sin v0, + sin vd, 


1 v 


+ n| sin nd, + sin ng, | = O(1), 
then again (2.2) holds. 
We now assume 2(¢,—9,) = (2u—1)7+56,, 5,=O(1/m), w an integer 21. 
Then 


1 
sin 6, + sin nd, = 2 sin = (0, + dn) cos = (@n — on) = o(-); 
n 


also 


-1 On — Da 

cos (0, — da) = Cos (0, — dn) COs $ 
2 2 

6. Pa 


Furthermore, for 


sin vO 
A? ) = A? f cos vidt = R f A?*z"dt, s = cit, 
v 0 0 


sin v0 
A? = rf 2’(1 — z)*dt, 
0 


v 


< f | 1-—z |2de = f | eitl2 citl2 
v 0 0 


t 63 
= af sint— ar< f = —- 
6 2 0 3 


Thus in view of (2.1) 
a3 2 sin vO, + sin vd, 


lA 


1 v 


(2.4) 


< (+62) = 00). 


We have thus proved the following theorem. 


THEOREM 2. Suppose that (C, 2) limt,=7, 0,20, 20,=O(1), 
nb, =n0,+(24—1)4+O(1/n), u an integer greater than or equal to 1, and that 
o(a) = ft sin idt has the same value for all limit points a of the sequence 
{n6,}. Then lim B, =(r/2) {o(a) +o(a+(2u—1)x) }. 


3. The average (1/n) >-*.,7.(0.) =H. We have 


= 
|| 
sin — (0, — ¢,) sin ———— = —). 
2 2 n 
Hence 
and 
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(3.1) nH, = >, v1, sin = sin = Curr, 


pel vel 
where 


(3.2) Cw = sin v@,, 1, 2,°°° » 


Thus 2H, + Cant. We first prove the foliowing theorem. 
THEOREM 3. Jf (C, 1) lim 7, =7, and 


1 n 
(3.3) — >| #, —a|—0, 
| 


then 


sint 
(3.4) f dt. 
0 


(3.3) is strong summability (C, 1) of the sequence {n0,} to a; it implies 
that @,—0, hence lim,... for each v. Furthermore 


— + Can | 


n—1 


= >>| d sin — 


v 


sin (+ sin 


1 v 


< 5 > (sin — sin + 1)0)| 


1 n 
+ sin + +1 
1 y+1 
n n—l 6 2 1 n—l 
2 2 1 


1 


<3>>#,+1=O(n), as 
1 


hence H, =O(1). Thus, in order that H, has a limit whenever n-'1,) tends to 
a limit 7, the additional condition is that 


1s 1 
lim — >> C,, = lim— sin 0, = 


exists. We then have H,—er (cf. Theorem A). Now 


n £ n n n 
n 
n 
| 
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% sin (« + 1/2)é 
Furthermore 
® sin (x + 1/2)é ae ® sin (x + 1/2) 
J 2 sin (¢/2) me 


and, from the mean value theorem, 


2sin (¢/2) ¢ 2t sin (t/2) 12. 


(3.5) 0 


Hence 


1 1 1 1 


(«+1/2)6« sin 2 


0 
(«+1/2)0« sin ¢ 6, 2 
= at f — dt — — + 
t a t 2 


where | 5,| <1/24. Thus, for 0<0,<1, 
-f —at < | (« + 1/2)0, — a] + 0, <| — a| + 
0 


and from (3.3) 
1 n a ; t 1 n 2 n 
n 1 0 t n 1 n 1 
which proves Theorem 3. For the case of (C, 2) summability we prove the 
following theorem. 


THEOREM 4: Jf (C, 2) lim 7, =7, 


| 
1943] 487 
| | 
and | 
and 


488 OTTO SZASZ 


(3.6) Dd »?| — = O(n), as 


1 
and if (3.3) holds, then (3.4) holds. 
Evidently (3.3) implies = O(n) ; furthermore 


(3.7) v?(0, 6,41) = > (2 1)6, = O(n) 27041, 


1 
hence from (3.6) 
(3.8) no, = O(1) as 
We have from (3.1) 


n—2 


where ”C,,=sin 20, =O(1), and 
n(Cu.n—1 — 2Can) = O(1) + (n/(m — 1)) {sin — 1)0,_1 + sin (n — 1)0,} =O (1). 
Furthermore, using (2.4) and (3.8), 


> »*| AC, | 


1 
n—2 1 n 2 n 1 n 
= > »*|— sin 0, -—— © sin (» + + —— sin + 
1 Vi omy v + 1 v + 2 
n—2 1 2 
> »?|—(sin + sin 0,41) — sin (v + 1)0,41 
v yv+i1 


1 


/sinvé, 2sin(v+1)@, sin + 2)6, | 
2 
2) v yv+i1 yv+2 ) 
n—1 
< O(n) + >| sin v6, + sin — 2 sin (v + 1)6,4:| 


1 


+ > | sin | 
1 amv+2 


< O(n) + DY + + v| — ) + 


1 


= om) +0( rts) = O(n). 


1 
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Now (3.4) follows as in Theorem 3. This proves Theorem 4. In this connec- 
tion the following lemma is of interest: 


Lemma. for some c>0 
(3.9) O S S (1 + c/n)O,, 
and 


(3.10) > vb, = O(n), 
1 


then (3.6) and (3.8) hold. 


We write > =>,’ ’, where >>’ contains all terms with 
6, £0.41, and >,”’ the rest. Now, using (3.9) and (3.10), 


0, = O(n), 
and from (3.7) 
hence >.’ =O(m) and (3.6) is proved. This and (3.10) yield (3.8). In particu- 
lar in Theorem 4 assumption (3.6) can be replaced by 0, | . 


4. The integral mean (1/0,)/3"(>_%7,7— sin vt)dt. To a given sequence {7,} 
we consider the transform 


(4.1) S, = dt = 


so that now 


= On 0; 
1 


(4.2) Gn» = (1 — cos y=1,2,---,% 


Evidently 0<a,,—0 as n— © ; hence a necessary and sufficient condition 
that for every convergent sequence 7,—7 the sequence S, has a limit o7 is 
that lim >-74,,=¢ exists (from Theorem A). Now 


1 — cos a 


sin (nm + 1/2)é 
2 sin (#/2) 


— #) 


| 
| 
| 
asn— ©, 

on 
4 0 
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n 6, On 
aw = —-— +6, (0, — 4) 
1 + 0 


sin (n a 


+6, x ~ o{ -\ sin (mn + 1/2)édt, 


and using (3.5) 


2 sin (¢/2) 


On 1 1 t 
On --} sin (2” + 


< 6, @.- < 


Hence 


(n+1/2)6n t sin ¢ 
= 1 0, 6, 
o(1) + ( — di 


(n+1/2) On sin 2 
= o(1 1 - 1/2)6,}. 
If 8S @ is a limit point of the sequence {n,}, then for a subsequence of 
integers 


® sint 1 —coss 


Now g’(8) =(1—cos 8)/8?20, hence g(8) T , as 6 increases, to g() =/2. We 
have thus proved: 


THEOREM 5. In order that for every convergent sequence t,->7 the trans- 
form S,.=>_i1,(1—cos v0,)/v%6, has a limit, it is necessary and sufficient that 
BS ~, exists. We then have S,—rg(8), where g(8) = sin é)dt 
—(1—cos Sx/2. 


We now assume only or We have "(Gur 
—Gn+1)Ts +4nnTn ; and the additional condition for the existence of lim S, 
is by Theorem A: 

=! |1— cos, 1— cos (vy + 1)0, 1 — cos n6, 


or 


(4.3) 6. y (» + 1) (1 — cos — » (1 — cos (v + 1)6,)| = O(1). 
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n n 1 n 
1 
hence (4.3) to cos v0,,—cos (v+1)6, | =O(1), as n— ©, or 
|sin (v+1/2)6, | = 
But this condition is equivalent to 70, =O(1) (cf. [3, §2]). Thus we have 
the following: 


THEOREM 6. The transform S,=)>_%r,(1—cos v0,)/v*6,) has a limit for every 
sequence {t,} which is summable (C, 1) to 7, if and only if lim n0,=B< @ ex- 
ists. We then have S,—rg(8). 


We finally assume (C, 2) lim r, =7; using the formula 


aT, = a, + 2an) + 


we now get for the existence of lim S, the additional conditions 

— cos v6, 1—cos(v+1)@, 1-—cos(v»+ 2)0, 
4.4) 6, —2 = O(1), 


2/1—cos(m— 1), 1— 1 — cos 


(4.5) 6, = O(1). 


(m — 1)? n? 


In particular sin? 20,/2=O(6,) =0(1), hence n0,—»8 = X an integer. 
We assume A>0; on putting 20, =2Ar+2e,, €,—0, we must have 


sin? (Aw + ¢€,) = sin? = O((Aw + €,)/n) = O(1/n), 


or €,=O(1/n"?). To satisfy (4.5) the additional condition is sin? (n—1)6, 
=0(6,), or (sin 28, cos 0, —cos 6, sin 6,)?=O(1/n), which reduces to our pre- 
vious condition. We finally show that now (4.4) is also satisfied. We have 


1 — cos vO 
= f dt f cos = f (@ — 4) cos védt, 
vy? 0 0 
hence 


A? (@ — cos vidt = Rf (@ — é)2"(1 — z)*dt, = 


and 


1 — cos 
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Using this inequality, the left side of (4.4) is less than >-*-71763 <n*03 =O(1). 
This proves the following theorem. 

THEOREM 7. The transform S, has a limit or (with o #0) for every sequence 
{r.} which is summable (C, 2) to r if and only if =2(dr+€n), =O(n-"?), 
\ an integer greater than 0. We then have S,—7f9*t" sin t dt=1a(2Am). 


5. (C, 2) summability of { nb,} for sine series. We shall prove the following 
theorem. 


THEOREM 8. If f(@)~>_*b, sin v0, and for some j 


6 8 
6.) f - = 28 040, 


then 
(5.2) (C, 2) lim nb, = j/. 


We have, for r,=nb,, 
= (n — v + 1)rd, = = (n—v+ 1) sin») 


and sin —(d/dt) —v +1) cos vt. On putting 


1 sin (m + 
cos = — + sin = 


we have 


1 
WD sin (» + 1/2)é 


2 2sin (¢/2) 

2 sin? ({/2) 


{ sin? (w + 1)(¢/2) — sin? (¢/2)} 


1 
2 
hence 


> (n—v-+1)v sin vt 
(5.3) 


sin (n+ 1)(t/2) 


t 
sin® (/2) {cos — sin (n+1) cos (n+1) sin} 


2 2 


| ‘ 
Thus 
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sin (n+1)(¢/2) {sin 
sin? (¢/2) 


t t t 
nm —— ncos (n+1) — sin — pdt; 


2 1 
(6.4) f 


furthermore 


(n — » + 1)v sin 
(n—1)/2 


=4 (= ) = In, 
say, where 2hn/n*—1 as n— ©. Our aim is to prove 
(5.6) tal — 0. 
From (5.3), (5.4) and (5.5) 
sin (m + 1)(t/2) 


where 
K,(é) = sin (nt/2) — n cos ((m + 1)#/2) sin (¢/2) 

= n sin (¢/2)(1 — cos ((m + 1)#/2)) — (m sin (¢/2) — sin (nt/2)). 
Now from (3.5) 


(5.7) 


sin* (t/ sin t /\sin? (¢/2) sin (¢/2) 


t 
2 sin (¢/2) 


and 
| Kn(t)| < msin (¢/2) +| sin n(¢/2) |. 


Hence 


| sin (nt/2) | 
< =f | dt = O(n) = o(hn); 


thus (5.6) will follow if we prove 


| 

| 

| 
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sin 1)t/2) K.(t)dt = o(n*). 


fl) — 4/2} 


In view of (5.7) this will follow from 


sin ((m 4- 1)t#/2) t 


(5.8) f { f(t) — j/2} r sin <(1 — cos (n + 1) =) dt = o(n), 


and 


(5.9) f — 4/2} 


sin — — sin nm — ) dt = o{n’). 
# 2 2 


Again, using (3.5), 
~ sin—)(1 — cos (n + 1) 


< | — at = O(1), 


thus (5.8) is equivalent to 


os (nm + 1) =) = o(n), 


We write I,=/3"+J%,=Li+L2, say, where c is a constant, arbitrarily 
large. On putting /? {f(t) —j/2}dt= F(0), and using integration by parts, we 
get from (5.1) 

F(t) sin (n + 1)(#/2) — 2-' sin (n + 
t t 
ele 1)#/2) — 1)t 
» f ry ((m + cos + 1) 


1 = 


+ 2f sin ((m + 1)#/2) 2-! sin (n + 1)é 


= + f sin + 14/4) sin ((m + 1)4/4) | at 


+ o(1) f sin? ((m + 1)4/4)dt = o(n), 


494 [November 
or 


1943] GIBBS’ PHENOMENON 495 


as Furthermore |La| —j/2|2 sin? we use 

here Fejér’s inequality, sin? x <(2x/(1+))? for x>0. Then 

(n+ 1)? ¢* dt 

_ 

A+ 1)4/4))? 

+ 1)? f 
en 4(1 + ((n + 1)t/4))* 


2 
where = —j/2| dt. But from (5.1), ¢(8) <6, y an absolute constant, 
hence 


| Le} < 


c/n 


(nm + 1)* tdt 
8 (1 + ((m + 1)4/4))* 

(m + 1)? dt 
2 A+ (n+ 1/4) 1+6/4 


and lim sup,.. Thus lim supn.. I,,| S8y/(4+c); but 
being arbitrary, now (5.8) follows. 

Similarly we decompose the left side of (5.9) into o"+Soin= Mit Mesay, 
where 


| L2| < O(1) + 


+ O(1), 


<O(1) + 


M,= {f() - (om ne — cos (m + 2) 


1 t t 
+ (cos (2n + 1) > — cos dt 
c/n 
c/n c/n 


+3 f (OF (de. 
0 


The mean value theorem yields 


4n‘? 12n4f 24n*t 96 


for i0, hence, using (5.1), we find that Mi=o(n*), as n>. 


| 
| 
| 
| 
| 
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To estimate M; we use the formula 
n sin x — sin mx = sin x), {1 — cos (nm — 2» + 1)x} > 0, 0<x<z, 
v=1 


which follows from 


inz n n 
sin 
= = = cos (m — + 1)z. 
1 1 


sin x — 


Now 
t 
| <f sin — 41 cos (n — 2v + 1) 
c/n 1 


t 
< {1 cos (n 2» +1) bat =D D, 


2vSn 2vSn 

say, and, as in the estimate of ZL: 

(n — 2» + 1)? 2y(m — 2v + 1) 
2(1 + (m — 2v + 1)x/4)? 1+ ¢/4 
8y(n — 2» + 1) 

4+¢ 
Hence | +4yn?/(4+<¢), and lim supra... | /n?<4y/(4+0); be- 
ing arbitrarily large, we finally get (5.9), and Theorem 8 is proved. 

6. The jump of f(@) and Gibbs’ phenomenon. The foregoing results enable 


us to give new formulae for the jump of f(@) and to prove a Gibbs phenome- 
non. 


THEOREM 9. Under the assumptions (5.1), j is determined by any one of the 
formulae 


D, < 


< + 


(6.1) (C, 2) lim nb, = as 


1 
1 n 0 


(6.3) 


(6.4) 


where s,(t)=)>.%, sin vt. 


sin t 
t 
i: 1 * sint 
tim — = —5 f — dt, 
amo 1 v 0 t 
1  f**sint 
| tim f = —j f —— dt, 
0 0 t 
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(6.1) is the statement of Theorem 8; (6.2) foliows from Theorem 2 for 
0, =0, 6,=7/n, T2=nb,, using (6.1). Similarly (6.3) follows from Theorem 4 
for T,=nb,, 0,=2/n, and (6.4) follows from Theorem 7 for 0, =22/n. 

To prove a Gibbs’ phenomenon under the same assumptions, put, in Theo- 
rem 2, 0,=0, then 


1 
Jo 


hence 
lim sup s,(0,) 2 7/2 X 1.08949---, 


Similarly from Theorem 4 for a=z 
—> s,(6, sin édt, 
| 0 


hence 
1 r 
lim sup 5,(6,) = —j f sin 


Theorem 7 also proves the presence of a Gibbs’ phenomenon, however with 


a smaller constant. 
We may also combine Theorem 8 with Theorem 3 of [3], putting there 
x=2, Summarizing, we have the following theorem. 


THEOREM 10. Under the assumptions (5.1) 


lim sup Lb, sin vO, 2’ 


2 
sin tdt = 1.08949--- . 
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ON REFLEXIVE NORMS FOR THE DIRECT 
PRODUCT OF BANACH SPACES 


BY 
ROBERT SCHATTEN 


Introduction. In a previous paper [7]('), for two Banach spaces Ej, Es, 
the Banach spaces Ei:@E:, E{ @E/, Ei' @Ei’ [7, p. 205] are constructed. 
If the norm N [7, Definition 3.1] is defined on Ei:@Es, then the associate 
norm N’ [7, Definition 3.2 and Lemma 3.1] is defined on E{ @E/. Similarly 
N”’ denotes the norm on Ej’ @ Ez’. 

Among the unsolved problems (mentioned in [7, §6]), are listed the fol- 
lowing two: 

A. What are the exact conditions imposed upon a crossnorm [7, Defini- 
tion 3.3] under which (Z,:@E:)’=E{ @Ei holds? 

B. Is the associate with every crossnorm also a crossnorm, or do there 
exist crossnorms whose associates are not crossnorms? 

In the present paper we present a “partial” answer to problem A (which 
we denote by A*), and a “partial” answer to problem B (which we denote 
by B*). 

A*. A uniformly convex crossnorm WN sets up the relation (Z,@£;)’ 
=E{ @Ej if, and only if, N’’=N. 

B*. For reflexive Banach spaces (that is, such that E{/’=£,, Ey’ =E,) 
the associate with every crossnorm is also a crossnorm. 

In this paper we also show that the values of a crossnorm for all expres- 
sions of rank not greater than 2 do not necessarily determine the crossnorm. 

The following should be mentioned in immediate connection with prob- 
lem A: 

It is evident that for norms for which (Z;@E:)’= Ej @Ej holds, N’’=N. 
Since in general (for any norm JN) all we can state is (Ei@E:)’>E{ @Ei 
[7, p. 205], we have no basis for assuming that N’’ represents the norm in 
or -N’’=N for expressions in E:) Ei’) [7, Defini- 
tion 1.3]. Therefore, N’’ SN [7, Lemma 3.2] is the best that can be stated in 
the general case. 

In the present paper we present some results on reflexive norms, that is, 
such that V’’=N. 

Presented to the Society, April 24, 1943, under the title “On certain properties of cross- 
norms.” Received by the editors March 11, 1943, and, in revised form, June 3, 1943. 

(*) Numerals in brackets refer to the bibliography at the end of this paper. Throughout 
this paper we shall use the results and notation of [7], with a slight modification. Since the 
printer does not find it easy to handle double and triple bars over expressions by machines, we 
shall write Z’ instead of Z, E’’ instead of E, E’” instead of E. Similarly N’ shall take the place 
of N, and that of Thus (N’’)’ shall stand for (NW). 
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In §1 we prove that norms that are reflexive, minimal, or have an asso- 
ciate property are identical. In §2 we show that for reflexive Banach spaces 
the associate with every crossnorm is also a crossnorm, that is, there exists 
a least crossnorm. In §3 we present a method for construction of reflexive 
crossnorms and prove certain inequalities. An application of the results of 
this section to Hilbert spaces permits us to construct semi-self-associate 
crossnorms, that is, for every natural number & a crossnorm 5S; is constructed 
such that S,#S and S,=(S;)’=S for all expressions of rank not greater 
than k, where S denotes the self-associate crossnorm for Hilbert spaces con- 
structed by F. J. Murray and J. v. Neumann [5]. This last result also proves 
that a crossnorm is not determined by the values which it assumes for all ex- 
pressions of rank not greater than 2. Finally, in §4 we show that a uniformly 
convex crossnorm sets up the relation (ZEi@£,)’=E;{ @Ej if, and only if, 

1. In this section we shall assume that £;, EZ; are Banach spaces, with no 
special restrictions. 

We introduce the following additional notation: 

a. If anorm N is defined on E2), then N”’ is defined on Ez’) 
By (N’’) we shall understand N’’ considered only on E:). 
Similarly if N is defined on is defined on A(Ei", Ei’’) 
Ef), and (N’’’) denotes N’”’ considered only on , 

b. The set &(Z:, E2) in which there is defined a norm N, we shall denote 
by Thus Ay (Ei, Es) shall denote the set E/) in which 
there is defined the norm N’ associate with N. 

c. The symbol : fE%*(E;, Es) : sup | F(f)| /N(f) shall denote the least 
upper bound for all numbers lFA)|/N A), obtained when f varies over 
Es). 


Lemma 1.1. Jf N is a norm in U(Ei, Es), then N'=(N"')'=N'" for F in 
Ei’). 


Proof. Let F € Then 
= : f U*(Es, Ex): sup | F(f)| /N(A) 
< M*(Ei’, Ex’) sup | F(f) (f) =n’). 
On the other hand, N’’’(F) $ N’(F) [7, Lemma 3.2]. This completes the proof. 


DEFINITION 1.1. A norm N will be termed minimal if, for every norm N® 
for which N’=(N°)’, we have N°2=N. 


DEFINITION 1.2. A norm N will be termed reflexive if (N’’)=N. 
DEFINITION 1.3. A norm N in U(Ei, Es) will be said to have an “associate 


| 

| 

| 

| 
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property” if, for a certain norm N° in , CUw(Er’, Ez’). 


THEOREM 1.1. For a norm N, the following statements are equivalent: 

(a) N is minimal, 

(b) N is refiexive, 

(c) N has an associate property. 

Proof. We shall prove (a)—>(b)—>(c)—>(a). Let N be minimal. By Lemma 
1.1, (N’’)’=N’ for E/) and any norm N in Thus 
(N’’) and N have the same associate. Since N is minimal by hypothesis, this 
means (N’’)2=N. On the other hand, for any norm N, (N’’) SN [7, Lemma 
3.2]. Thus (N’’)=N. We have proved (a)—>(b). 

Suppose JN is reflexive, that is, (V’’)=N. Then obviously NW has an asso- 
ciate property, because 


E2) Ww (Ei, E:) Un (Ex’, Ez’). 


Therefore (b)—>(c). 

Suppose finally that NV defined in M(Z,, Z2) has an associate property, that 
is, for a certain norm N° defined in A(E}, E/), E2) CUw-(Ei’, Ez’). 
Let thenorm Nin Ez) besuch that (N)’ = N’. Then, for FEA*(E; , 


we have 
(N%)'(F) = N’(F) = : f U*(Ey, E:) : sup | F(f)| /N(f) 
fE Ei’): sup | F(f)| 


= (N°)"(F) N*(F). 


Thus, throughout Ei), and therefore (N®)’’>(N*)’ 
throughout Ei’). Consequently, for f in 
E:) CU*(Ei’, Ef’). Thus N is minimal. Therefore (c)—>(a). This com- 
pletes the proof. 


1. If N and N® denote two reflexive norms, such that NS N® 
and N#¥N*, then N’=(N°)’ and N’#(N°)’. 

Proof. Obviously N’2(N°)’. If it were N’ =(N°)’, then N’’=(N°)”’ and 
(N’’)=((N®%’’). By hypothesis N and N° are reflexive. Thus Definition 1.2 
gives N=WN°. This contradicts our assumption. 


Coro.iary 2. If N and N° denote two norms in U(E;, Ex), of which N is re- 
flexive, and NS N° throughout E2), then NS((N°)’’). 


Proof. By assumption, NS N°. Therefore N’2(N®)’ and [7, 
Lemma 3.3]. Thus (N’’)S((N®)’’). N is reflexive by assumption. Therefore 
N=(N"), and NS((N°)’’). This completes the proof. 

Remark. Property (c) of Theorem 1.1 suggests the existence of an infinite | 
number of different reflexive crossnorms. We shall prove later that this is 


Zz 
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the case, and present a method for construction of reflexive crossnorms(’). 

2. Throughout the rest of this paper we shall assume E/’, Ez’. 
In this case E2)=A(Ei’, Ei’), and for any norm NWN in 
(N"’)=N", (N’"’)=N’"’. It should be noticed, however, that many results 
of the following sections are also valid for general Banach spaces. 

Let Ni(E:, E2), Ne(E:, E:) denote the least crossnorm whose associate 
is also a crossnorm and the greatest crossnorm in W%(Z;, Es), respectively; in 
the case where there is no fear of misunderstanding, we shall write simply Nz, 
Ne [7, Definitions 4.1, 4.2, Lemmas 4.1, 4.2 and Theorem 4.1]. 


LEmMMaA 2.1. E.)=N 1(Et’, Ez’) and E:)=Ne(Ei’, Bf’). 


Proof. This is a consequence of the definition of the least and greatest 
crossnorms [7, Definitions 4.1, 4.2]. 


LEMMA 2.2 For any crossnorm N in E:) for which Ex), we 
have Ed), Ei')=N1(-,, E:), 


Proof. Let N denote a crossnorm whose associate is also a crossnorm, that 
is, N2Nz [7, Theorem 4.1]. Since N’’SN [7, Lemma 3.2], N’’(f@¢) 
<N(f@¢)=|lF\| |lel] for fEZ:, PEE But N’’ is the associate of the cross- 
norm N’. Therefore N’’(f@¢g) 2||fl| ||¢l] [7, Lemma 4.3]. Thus N’’ is a cross- 
norm, or the associate with the crossnorm WN’ in Y(E;/, Ez) is a crossnorm. 
This gives N’>N1(Ei, Ei) [7, Theorem 4.1]. From N2Nz(E:, Ex) we con- 
clude N’2N.(Ei, Ez). Similarly from N’2Nz(E{, Ef) we conclude 
Ef’) =N1(E:, E:), by Lemma 2.1. This completes the proof. 


LEMMA 2.3. Nz is reflexive. 


Proof. Obviously N,2=Nz. Therefore (Nz)’’=N1z, by virtue of Lemma 
2.2. But (Nz)’’S Nz [7, Lemma 3.2]. Therefore (Nz) =Nz. 


LemMA 2.4. The associate with the greatest crossnorm is the least crossnorm, 
that ts, (Ne(~,, E.))'’=Ni(Ei Ed). 


Proof. Since No(E:, E:)2N1(E:, Ex), as 
follows from Lemma 2.2. But (Nz(Ei, E/))’ is a crossnorm [7, Theorem 4.1]. 
Thus (N.i(Ei, E/))’SNoe(Ei’, Ei')=Ne(E:, Ex) [7, Theorem 4.2], or 
Ni(Ei, E2))’, as follows from Lemma 2.3. 
Therefore (Ne(E:, E2))’=Ni(Ei, Ef). This completes the proof. 


CoROLLARY. N, and N¢’ are associate with each other. 
Proof. The associate with N¢’ is Nd” =N¢ =N1z, by Lemmas 2.4 and 1.1. 


(?) The question of existence of non-reflexive crossnorms is not settled in this paper. It 
should be noticed that for every non-reflexive norm, Ei @E; would be a proper subset of 
(Z,@£;)’. In particular it is not settled whether the greatest crossnorm is reflexive. It can be 
shown, however, that when £, E; are Hilbert spaces, then the greatest crossnorm is reflexive. 


| 
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The associate with Nz; is N/ =Nd¢’, by Lemma 2.4. 
Lemma 2.5. If, for a certain crossnorm N, N'2=N1, then N2N,_. 


Proof. Lemma 2.2 gives N’’2 therefore N2 N’’=N_. This completes 
the proof. 


THEOREM 2.1. Every crossnorm N2N1. 


Proof. Since N is a crossnorm, N<Ne [7, Lemma 4.2]. Therefore 
N’'=Nd =N1z, as follows from Lemma 2.4. Thus Lemma 2.5 gives N2 Nz. 


This completes the proof. 
THEOREM 2.2. The associate with every crossnorm is also a crossnorm. 
Proof. This is a consequence of Theorem 2.1 and [7, Theorem 4.1]. 
THEOREM 2.3. There exists a least crossnorm. 


Proof. It is obviously Nz, as follows from Theorem 2.1 and [7, Definition 
4.2 and Lemma 4.4]. 


COROLLARY. For every crossnorm N in 


@ for F € E;. 


i=1 


Proof. This is a consequence of Theorem 2.1 and [7, Theorem 4.1.1]. 


The question of existence of a norm not less than Nz which is not a cross- 
norm, and whose associate is a crossnorm, remains open. It is clear that if 
such a norm exists, it must be non-reflexive. 


3. DEFINITION 3.1. Let E1, E, denote two Banach spaces, and N a cross- 
norm in U(Ei, Ei). We define a sequence of functions {N,} for expressions 
in U(E, Ex) =U(Ei’ , Ez’) im the following way: 


n k n k 

t=1 j=1 j=1 

where sup, that is, the least upper bound, is taken for all sequences of k terms 

Fi, +++, Fein Ei and Ef. 


THEOREM 3.1. For every natural k, N; is a crossnorm in U(Ex, E2). 


Proof. The proof is similar to that of [7, Theorem 7.2]. 
From the definition, and Ni=N’. 


THEOREM 3.2. For every natural k, the crossnorm N, is reflexive. 


Proof. Suppose that for a certain expression )-i_,ff@¢?GU(E:, Ex), 
(if @¢?) — where is a certain positive number. 


[November 
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Then for a given expression in E/), there exists an expres- 
sion such that 


| Lhe 


«)- 


This means 
< m( 


for every sequence of & terms in Ef; oe, in Tak- 
ing originally for an expression of the form we get 
| | /N <Ni(Li-sf? — efor every se- 
quence of & terms Fy Fi in EY, ou in EZ. Therefore 
S @¢?) — €, as follows from Definition 3.1. The last 
inequality cannot hold. This compietes the proof. 


THEOREM 3.3. If N is a reflexive crossnorm, then (N,)'=N for all expres- 
sions of rank not greater than k. 


Proof. Obviously (N;)’ SN for all expressions of rank not greater than k 
as a consequence of Definition 3.1 for N;, and that of an associate with a given 
norm. On the other hand everywhere in £2). Therefore 
(Ni)’2N’’=N. This completes the proof. 


THEOREM 3.4. If N is a reflexive crossnorm, then N, is the least crossnorm 
whose associate equals N for all expressions of rank not greater than k. 


Proof. Theorem 3.3 gives (N;)’=WN for all expressions of rank not greater 
than &. On the other hand it is easy to see that if a crossnorm N° satishes the in- 
then there exists an expression such that 
Thus (N°%)’=N, for all expressions of rank-not greater 

than k, implies N°2 N;. This complétes the proof. 
‘ It is not difficult to see that the last theorem is a generalization of [7, 
Theorem 4.1]. 


| 
| 
| 
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In the case E; and E; are Hilbert spaces, we may assume W%(Fi, Es) 
=%(Ei, E/). In this case, N and N’ are defined in %(Hi, E2). For the case of 
Hilbert spaces, F. J. Murray and J. v. Neumann define in &(Zi, Ee) a self- 
associate crossnorm, which we shall denote by S [5]. 


THEOREM 3.5. For every natural p, S,=(S,)'=S for all expressions of 
rank not greater than p. 

Proof. Since S=S’, S=S'’. Thus S is reflexive. Theorem 3.3 gives 
(S,)’=S for all expressions of rank not greater than p. Now suppose 
that for a certain expression >-*.,f:@g; and a certain crossnorm NSS, 
es) fii). Since NSS, N’>S'=S. But NN’ [7, 
p. 213]. Therefore fi: An application of the last 
remark to the fact that S,<.S and (S,)’=S, for all expressions of rank not 
greater than p, gives S,=S for all expressions of rank not greater than p. 
Thus S,=(S,)’=S for all expressions of rank not greater than p. This com- 


pletes the proof. 

THEOREM 3.6. For every natural p, Sp%Sp+1- 

Proof. That S:*.S3(*) is shown in the following manner: Consider an ex- 
pression + -f2@g2: + -fs@¢s, where fi, fe, fs, and ¢1, ¢2, gs form ortho- 
normal sets. A calculation shows that 
S2(f: @ gic @ 

| (hi @ ++ he @ x2) (fr @ fe @ +-fs vs) 
S(hi x1°+-he ® x2) 
where sup, that is, the least upper bound, is taken for all pairs Ai, he, in Dr 


(the closed linear manifold determined by /, fe, fs) and x1, x2, in Ms (the closed 
linear manifold determined by ¢1, ¢2, gs). From this it is easily concluded that 


S2(fi@gi: + + $2"? and 
S3(fi @ +-f2 @ +-fs @ os) = @ +-fe @ +-fs @ vs) 


A similar reasoning can be applied to prove Sy#S,4: for any natural p. 


= su 


Coro.tiary. S,#S for p=1, 2, 3,---. 


Proof. Obviously S,SSp4i15S. But S,~S,41. Therefore S,#S. This com- 
pletes the proof(*). 


(*) This result and proof is due to F, J. Murray. 

(*) For a reflexive (or not) norm N, N, is reflexive. There exist, however, reflexive norms, 
for instance S, such that for no reflexive norm N is S= N,. Proof. For a reflexive N, Definition 
3.1 and Theorem 3.3 give ((Nz)’)x= Ne. If it were S= N, for a reflexive N, then (N;)’=S’=S 
and S,=S as a consequence of the preceding relation. We have shown above that this is not the 
case. This completes the proof. 
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Remark. From Theorems 3.5 and 3.6 it is evident that we have constructed 
three different reflexive crossnorms, which are equal for all expressions of 
rank not greater than p; namely Sp, (S,)’, S. 

S, and (S,)’ are reflexive, hence associate with each other. 

Thus for every natural » we have constructed reflexive “semi-self-asso- 
ciate” crossnorms S, and (S,)’, that is not self-associate, but equal to their 
associates for all expressions of rank not greater than p. 

Incidentally, this result also proves that the values of a crossnorm for all 
expressions of rank not greater than p (where p denotes any natural number) 
do not necessarily determine the crossnorm. 

4. In the introduction of this paper, it was pointed out that if a norm 
sets up the relation (£:@E2)’=E/ @Ez, then N’’=WN for expressions in 
W(E:, E:) CU(Ei’, Ez’), or N is reflexive. In the present section we consider 
a “converse” problem. 


Lemma 4.1. If N is a reflexive crossnorm in U(E;, E:), then E, QE, 
@Ez’. 


Proof. The linear set U*(Ei’, Ez’) in which there is defined the norm N’”’ 
is an extension of the linear set &*(#:, Hz) in which there is defined the 
norm N. Thus the closure of A*(Z/’, Ez’) is an extension of the closure of 
E:). This completes the proof. 


Lemma 4.2. If N is a reflexive crossnorm in Ex), then E{ forms 
a fundamental subset of (E:@E:2)', that is tf, for all F in E{ @Ei and a certain 
fo in Ei @E:, F(fo) =0, then fo =0. 


Proof. For FoGEi @Ei , we have N’(Fo) =sup | Fo(f)| /N(f) where sup is 
taken over the set of all f’s in Ei@E: [7, Lemma 3.4]. Similarly, for 
Fit @Ei’, =sup | F#(F)|/N’(F) where sup is taken over the 
set of all F’s in E{ @E/. N is reflexive by assumption. Therefore for 
Es, Lemma 4.1 gives N’’(fo) =sup | F(fo)| /N’(F), where sup is taken 
over the set of all F’s in E{ @E;. The second part of our assumption gives 
N’'(fo) =0. But N’’=N. Therefore N(fo) =0. Thus fo=0. This completes the 
proof. 


Lemna 4.3. If a reflexive crossnorm sets up the relation (E,@E:)'’=E, @E2, 
then (Ei: 


Proof. Suppose that (E;®E:)’’ = E,@ Ez, and F* is an element of (E:@E:)’ 
which does not belong to Ei @E/. From the construction of Ei @ Ej follows 
that the set is closed in (E,:@£,)’. Hence, there exists a linear functional 7 
on (£,@£:)’, such that 7(F*)=1 and ¥(F)=0, for F in E{ [1, p. 57]. 
Since (EZ, @E:)'’=E,@Ez, there exists an fy in E,@Es, corresponding to 7, 
such that F*(f,)=1 and F(f>)=0 for F in E{ @Ei. The last condition im- 
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plies fy=0 by Lemma 4.2. This contradicts F*(f))=1. This completes the 
proof. 


THEOREM 4.1. If a reflexive crossnorm N defined on U(Ei, Es) is uniformly 
convex [2], then (E:@E2)'=E{ 


Proof. By continuity it follows that N is uniformly convex in E,@E2, 
hence [3, 4, 6] (Z:@£:)’’=E,@E:. An application of Lemma 4.3 completes 
the proof. 
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QUASIGROUPS. I 


BY 
A. A. ALBERT 


1. Introduction. A theory of non-associative algebras has been developed(!) 
without any assumption of a substitute for the associative law, and the basic 
structure properties of such algebras have been shown to depend upon the 
possession of almost these same properties by related associative algebras. 

It seems natural then to attempt to obtain an analogous treatment of 
quasigroups. We shall present the results here. Most of the results in the 
literature on quasigroups do depend upon special associativity conditions(?) 
but no assumption of such conditions is necessary for our theorems. 

Every quasigroup @ may be associated with the group G, of nonsingular 
transformations generated by its multiplications. The isotopy of two quasi- 
groups may then be defined and, as in the case of algebras, two groups (that 
is associative quasigroups) are isotopic if and only if they are isomorphic. 
Every quasigroup is isotopic to a loop, that is, a quasigroup with an identity 
element, and we derive all further results for loops. 

The concepts of coset and normal divisor may be defined for loops without 
any assumption of associativity(*). Then a subloop § of © is a normal divisor 
of @ if and only if §=eI where e is the identity of G and T is a normal di- 
visor of the group G,. Isotopic loops have corresponding normal divisors, and 
loops which are isotopic to simple loops are simple. We also combine the con. 


Presented to the Society, September 13, 1943. Received by the editors June 17, 1943. 

(*) See the paper of N. Jacobson, A note on non-associative algebras, Duke Math. J. vol. 3 
(1937) pp. 544-548, as well as the author’s papers Non-assoctative algebras. 1. Fundamental 
concepts and isotopy, Ann. of Math. vol. 43 (1942) pp. 686-707; The radécal of a non-associative 
algebra, Bull. Amer. Math. Soc. vol. 48 (1942) pp. 891-897. 

(?) Cf. A. Suschkewitsch, On a generalization of the associative law, Trans. Amer. Math. 
Soc. vol. 31 (1929) pp. 204-214. A suggestion of isotopy theory is given there and the assump- 
tion is made that the right multiplications form a group. Then he shows that the quasigroup 
is isotopic to a group. See also D. C. Murdoch, Quasigroups which satisfy certain generalized 
associative laws, Amer. J. Math. vol. 61 (1939) pp. 509-522; ibid., Structure of abelian quasi- 
groups, Trans. Amer. Math. Soc. vol. 49 (1941) pp. 392-409; H. Griffin, The abelian quasigroups, 
Amer. J. Math. vol. 62 (1940) pp. 725-737. 

(*) In this connection see B. A. Hausman and O. Ore, Theory of quastgroups, Amer. J. 
Math. vol. 59 (1937) pp. 983-1004. These authors determined associativity conditions in order 
that the classical coset decompositions be valid. A similar idea was used by D. C. Murdoch, 
Note on normality in quasigroups, Bull. Amer. Math. Soc. vol. 47 (1941) pp. 134-138 but not 
by G. N. Garrison, in his Quasigroups, Ann. of Math. vol. 41 (1940) pp. 474-487, who also 
discussed the concept of normal divisor but used the finiteness of the system in an essential way. 
The definition we shall give here for loops seems to be much more natural than these others 
and reduces to the classical definition in the case of groups. 
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cepts of isotopy and homorphism to yield a new concept of homotopy(‘) of 
loops. Then we show that if a loop © is homotopic to a loop @’ it is homo- 
morphic to a loop which is isotopic to G’. 

2. Permutation groups. A transformation on a set @ is defined to be a 
correspondence 
S: g—gs 
on & to G. If the correspondence is one-to-one it is usually called a non- 
singular correspondence. It will be convenient here to refer to nonsingular 
transformations as permutations. This is standard usage in the case where G 
is a finite set and there seems to be no reason why the terminology should not 
be extended to the general case. 

The set = of all permutations of © forms a group with respect to the oper- 
ation of product defined by g(ST) =(gS)T. It is customary to call this group 
the symmetric group. 

If T’ is any set of permutations of G, and g is any element of G, we define 
gI to be the set of all elements gS for S in . When I is a subgroup of 2 it 
contains the identity transformation J and gI contains gI =g. 

We call a subset § of @ a transitive system for a permutation group [ 
on @ if 4S is in § for every h of $ and S of I and if, for every & and k of $, 
there is an S in T such that k=4S. Then we have the following simple lemma: 


Lema 1. The transitive systems $ for a permutation group T on © are the 
sets $=hoI for ho any element of H. 

For if § = hI then every element of H has the form h=hoS, (oS) T =ho(ST) 
is in $. Also h=hoS, k=hoT implies that k=(hoS-!)T=ho(S-'*T) for S-'T 
in I’. Conversely if $ is a transitive system and hp is in © there is an S such 


that k=hoS, 
We shall call a permutation group I a transitive or intransitive group ac- 


cording as @ itself is or is not a transitive system for I. When [ is transitive 
we have gI' =@ for every g of G. If T is intransitive every gI is a proper sub- 


set of G. 
3. Quasigroups. A set © is said to form a quasigroup if a product 


is defined to be a unique element of @ for every a and x of @ such that the 
equations 


(1) a-x=b5, y:a=b 


have unique solutions x and y for every a and 6 of ©. Then every x of G de- 
termines a transformation 


(*) This term has a very different meaning in topology but its use here and in the theory of 
algebras seems very desirable. 
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R:: 


of @. If cR.=gR. then a-x=g-x and the second uniqueness hypothesis of (1) 
implies that a=g. Thus every R, is a permutation of @. We shall call R. a 
right multiplication of © and shall designate by G, the set of all right multi- 
plications of G. In general G, is a subset but not necessarily a subgroup of 2. 
In a similar fashion we determine the left multiplications 


g—>gR. = 


L:: gel. = 


of @. They are also permutations. 

A quasigroup now consists of a set & of elements g and a corresponding 
set @, of permutations R, on @. The left multiplications L, are then deter- 
mined transformations and will be permutations if it is true that gR.=gR, 
only if x =y. This requires that the correspondence g—R, shall be one-to-one 
and that if two distinct permutations of @, be applied to the same element of 
© the results shall be distinct. Conversely when these conditions are given @ 
will be a quasigroup. For g-k=gR, x=bL;', y=bR;" are all uniquely deter- 
mined elements of © when every R, and L, is a nonsingular transformation. 

Note that gS=gT is possible for distinct permutations S and T on © but 
that if xS=xT for every x of G, where S and T are independent of x, then 
S=T. Note also that in the case where @ is a finite set of m elements @ - - - én 
we are stating that G consists of these elements and m permutations R; - - - Ry 
on them such that if x;R;=x;R; for any i then j=. 

4. Isotopy. Two quasigroups @ and G™ are said to have the same-order 
if there is a one-to-one correspondence g—g,) between them. In our general 
considerations there will be no loss of generality if we regard all quasi- 
groups of the same order as consisting of the same elements and thus differing 
only in their right multiplications. 

Any quasigroup @ now consists of a set I? of elements and a correspond- 
ing set G, of right multiplications. Let @ be a second quasigroup consisting 
of the same set of elements and a set G, of permutations R, on Mt. Then we 
shall say that © and @ are isotopic if there exist permutations A, B, C 
on M such that 

(0) 


(2) =AR,5C. 


This is equivalent to the definition of product (g, x) in @™ in terms of that 
in © by 


(3) (g, x) = (gA- xB)C. 
Then (x, g) =(xA -gB)C and 
(4) = BL.C. 


Note that if @ is given then the transformations R, and L, are products of 
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permutations and are permutations. Thus if @ is a quasigroup and we define 
© by (3) the resulting isotope is a quasigroup. As in the theory of nonasso- 
ciative algebras the relation of isotopy is an equivalence relation. 

5. Principal isotopes. Two quasigroups @ and @ of the same order are 
said to be isomorphic if there exists a permutation P on their common set 
of elements such that (gP, xP) =(g-x)P. Then (gP-!-xP-')P =(a, x) and, if 
we define A =P-', we see that the isomorphism of two quasigroups may be 
regarded as the instance 


of isotopy. Moreover we have 


LemMaA 2. Every isotope © of a quasigroup © is isomorphic to a principal 
tsotope G defined by 
(0) (0) 


(6) R,z = Lz BL za. 


For let RY =SR.rU. Then is isomorphic to defined by 
R® = UR“ U-!=ARzzg where A=US, B=UT. The second formula of (6) 
follows from the consequence (4) of (2) in the case C=J. 

Principal isotopes are more convenient to study than general isotopes and 
we shall use Lemma 2 frequently in our proofs. 

6. Loops. A quasigroup © is said to have an identity element e if 
e-x=x-e=x for every x of G. Then xR,=xL,=<x for every x, and e is that 
unique element of @ such that R,=L, is the identity transformation J. It 
will be very convenient to have a special name for such quasigroups and, as 
we said in our introduction, we shall call them loops. 

We shall show that every quasigroup is isotopic to a loop. A construction 
of ail loops will then yield quasigroups from which all quasigroups may be 
constructed. Thus it will be at least reasonable to restrict all further study to 
that of loops. 


THEOREM 1. Let f and g be any elements of a quasigroup © so that there exists 
a unique h in © such that f=g-h. Then f is the identity element of the loop which 
ts the principal isotope of © defined by (6) with 


(7) B=L,. 


Conversely if a loop @ is a principal isotope of © it is defined by (6) and (7) 
for elements 'g and h of © such that f=g-h is the identity element of ©. 


For if f=g-h=gR,=hL, we may write g=fA,h=fB for A and B of (7). 
Then Rf =AR;p=AR,=I, Conversely if is de- 
fined by (6) and f is the identity element of @® then Ri? =AR,;,=L!” 


= BL;,=I. Put g=fA and h=fB to obtain (7) as well as f=gA-'!=gRi=g'‘h. 
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7. Groups. A quasigroup @ becomes a group when we assume the asso- 
ciative law x-(a-y)=(x-a)-y for every a, x, y of G. Then aR,L.=aL-Ry. 
Hence @ is a group if and only if R.L,=L,R, for all right and left multiplica- 
tions of @. It is known that the group property implies the existence of an 
identity element, that is, an associative quasigroup is always a loop. We also 
have 


Lemma 3. A loop © is a group if and only if G, is a group. Then © and G, 
are isomorphic. 


For x—R, is a one-to-one mapping of © on G,. If G is a group we have 
(a-x)y=aR.R, =a: (x-y)=aR..,, RzRy=R:.y, our correspondence is an iso- 
morphism, @, is a group. Conversely let © be a loop with identity element e¢ 
so that 


(8) eR, = eL, = & 


for every x of G. If G, is a group we have R.R,=R, where g is a uniquely 
determined element of G. For the permutations S of G, are in a one-to-one 
correspondence S—g with the elements g of G such that S=R,. But eR,-eR, 
=eR,-y=eR,R, =eR, = g=x-y, RzRy=R:z-y, g-R, is an isomorphism of the 
group G, and @. 

Note that in the case of an arbitrary loop we have x-y=eR,-eR,=eR.Ry 
=g=eR,, but we do not have R.R,=R,. For R.R, may not even be in G,. 

We may now prove the important theorem: 


THEOREM 2. A loop is tsotopic to a group if and only if it is isomorphic to 
the group and thus is itself a group. 


For let © be a loop isotopic to a group @ so that we may assume @ 
to be a principal isotope of @ defined by (6), (7). Let e be the identity ele- 
ment of G, so that, if x-y=e, we have R,.y=R.R,=I, Ry=Rz'. 
But and 


Similarly LO But then every R® is commuta- 
tive with every L©, G is associative. By Lemma 3, © and G, are isomorphic, 
=G,, G and are isomorphic. 

8. Subloops. The right multiplications R, of the elements h of any subset 
§ of a loop G generate a group of transformations which we shall designate 
by §,. It consists of all finite products of positive, negative and zero integral 
powers of the transformations R, for 4 in $. The left multiplications L, also 
generate a transformation group §y, and the right and left multiplications 


512 A. A. ALBERT 1 [November 


together a group §,. All of these groups are subgroups of the group G, de- 
fined as above for 5=G. 

A subloop of a loop @ is a subset § which forms a loop with respect to 
the defining operation of @. It is sufficient that § form a quasigroup, that is, 
that § contain all products h-k and the solutions x and y of x-h=k, h-y=k 
taken when A and k are in §. For then § will contain the solution e of x-h=h. 
Thus the identity element of a loop © is the identity element of all of its subloops. 


Lemma 4. A set $ is a subloop of © if and only if e,=§. 


For let § be a subloop of © so that contains eJ =e. The elements of §, 
are products of transformations Rs, Ls, Ri, Li’, and if k=eS is in H, then 
e(SRi) =k-h, e(SL,) =h-k, are all in 
Hence e§, is contained in every h=eR, is contained in eG,, 
Conversely if we have eR,R,=h-k and the solutions x=eR,R;"', 
y=eL,Lr' of x-h=k and h-y=k in §. Then § is a subloop. 

It should be observed that the order of a subloop § of a finite loop G need 
not divide the order of G. For example let G consist of e =e, é2, €3, €4, &s, and 
let R., = R; be defined by R,.=J, Re=(12)(345), Rs= (13) (254), Ra=(14)(235), 
Rs=(15)(243). Evidently e;-e=e;R,=e;, e is the identity ele- 
ment of @. We compute €2°€4=€3, €2°€5=€4 SO that 
L2(12)(354). By symmetry L3=(13)(245), L4=(14)(253), Ls=(15)(234) and 
© is a loop. Its four subloops (e, e;), defined for i= 2, 3, 4, 5, are all groups of 
order two. 

9. Cosets. Every subgroup I of G, defines subsets xI of G which we shall 
call cosets of @ relative to I’. They are transitive systems of ! by Lemma 1. 
Thus every element of © is in one and only one coset and the sets xI provide 
a coset decomposition of G. 

If § is a subloop of a loop G we shall call the cosets x, the right-hand 
cosets of & relative to §. Similarly, we shall call the cosets x, the left-hand 
cosets of @ relative to H, and the cosets x$, the two-sided cosets of © rela- 
tive to 

The sets x consisting of all elements x-h for 4 in the subloop § do not, 
in general, have the property that each element is in one and only one such 
set. For if x-k=y-k then y=xR,R7"' is in x, but may not be in x§. We 
shall not call such sets cosets. However, our definition of normal divisors will 
have the consequence = = xO. 

With every subgroup I of @ and corresponding coset decomposition of G 
we associate the set I* of all transformations S of G, such that xS is in xT 
for every x of G. Then xI™ is contained in xI’. But clearly T is contained 
in «I is in xI'*, xf! for every x. Also we have 


Lema 5. The sets I'* are subgroups of G,. 


For if H and K are in I* and x is in G we have y=>xH=<xS where S is 
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in Then yK=yT where T is in (eH) K=(xS)T, x(HK) =x(ST)=xU 
for U=ST in I, HK is in T'*. Also z=xH-'! has the property zH=zS where 
Sis in if His in T'*, x=2S, x! =zST=<aI since is a group, if S is 
in I. Then g=xH— is in and H- is in I'*. 

Note that every subloop § of @ determines a unique subgroup §; of G, 
such that eH,*=e$,=$=e$, by Lemma 3. However §, is not necessarily a 
subgroup of §,*. 

10. Normal divisors. A homomorphism of a loop @ on a loop ® is a 
mapping g—g’ of G on G’ such that every element of G’ is the image g’ of 
an element g of G, and (x-y)’=x’-y’ for every x and y of G. The mapping 
x—«x, of G on the quotient set G/H of all cosets x§, satisfies the first of 
these properties. We shall call a subloop § of & a normal divisor of & if G/H 
is a loop and x—>x, is a homomorphism of © on G/§. 

The hypothesis that G@/§ is a loop implies that either of the equations 


(10) = wGH,, xD, = wH,-xH, 
is true if and only if y$,=w,. The homomorphism hypothesis implies that 


(11) = Dp 
and thus defines the product operation in G@/. Conversely (11) and (10) im- 
ply that G/Q is a loop and that x—x§, is a homomorphism of © on G/®. 
For $ =e, and by (11), we have §-yH,=yG,, xH,-H=xGH,, H is the identity 
element of Also gO, - yD, = (g-¥)H, = and xH, - gH, = (x - g)H, = are 
satisfied by y=bL;>'1, x=bR;z' and the corresponding coset solutions are 
unique by (10). Then @/Q is a loop as desired. 

Note that if 
(12) 
for every x, and (11) becomes 
(13) = (x-y)I. 
Also we may use (11) to write (10) in the form 
(14) (x-y)T =(w-y)T, = (y-w)P 
if and only if x! =wI. We are now in a position to prove the fundamental 
theorem: 


THEOREM 3. A subset $ of a loop © with identity element e is a normal 
divisor of © if and only if S=el where T is a normal divisor of G,. Then 
I'*=§,* is also a normal divisor of G, and 


(15) aD = x, = = xD, = 2H, 


for every x. 
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For let $ be a normal divisor of G and define [=§;* so that we have 
(12), (13), (14). Since every permutation of © is a finite product of permuta- 
tions R,, R;=', L,, Lz! the group I will be a normal divisor of G, if we can 
show that 7-'IT is contained in T for T=R,, R;>', L,, L='. Now for every 
x and y of @ and S of T we define w=xR,SR;"' so that xR,S=wR,, 
(x-y)T =(w-y)T. By (14), x! =wI, w is in xl. But our definition of $7 
implies that R,SR;>' is in IT, R,IT'R;" is contained in Similarly define 
w=xL,SL;" and obtain (y-x)S=y-w, (y-x)T=(y-w)T, wf w is in 
and L,I'Z;" is contained in T. We next put s=xR;' and have z-y=x, 
zS-y=(z-y)U where U is in T by (13). Then xR>'SR,=<xU is in xT and 
R;'TR, is contained in I. A similar argument with z=yL>" yields x-2S 
=(x-2)U, yLz'SL,=yU and implies that Z>"TL, is contained in 
is a normal divisor of G,. 

Conversely let §=eI where I is a normal divisor of @,. If S and'T are 
in T then eS-eT =eSR,ry=eR,pU=eTU where U is in Tl, TU is in I. Also, 
eS:y=eT implies that eSR,=eR,U =eT, y=eTU— is a unique element of §. 
Similarly x-eS=eSL,=eL,U=xU=eT has the unique solution x=eTU- 
in $. Thus © is a subloop of G. Now xS-yT=xSRyr=xRyrU=(x-yT)U 
=yTR,.U=yL,VU=(x-y)VU where V and U are in I. Hence xI'-9T is 
contained in (x-y)I’. But if S is in T we may determine U=R,SR;" and 
have (x-y)S=xR,S=xUR,=xU-y, (x-y)T is contained in xI'-yI and we 
have (13). If (x-y)'=(x-w)T then yL.T But =TL.and =u 
as desired. Similarly (y-x)!'=(w-x)l and yR,T=wR.T, yl =wI. Now 
xS=eL,S=eUL,=hL.=x-h where h=eU is in § and so xI is contained in 
x. Also x-h=x-eS=eSL,=eL,U=xU for U in T, x§=<T. It follows that 
contains every Ry. Similarly contains 
every L,, contains §,, xI'* =xI contains x$, which contains x§ This 
gives (15). But x§,=xI and [*=$7, xl’ =x; and we have (13) and (14) 
for T' replaced by §,*. This proves our theorem. 

It should be observed that there may be several normal divisors I of G, 
such that $=eI is the same normal divisor of G. Indeed §6=eI =eA if and 
only if $ is a transitive system of both T and A. 

As an expected consequence of our definitions we now have 


THEOREM 4. Let g—g’ be a homomorphism of a loop © on a loop &' and H 
be the set of alk elements h of © such that h’ is the identity element e’ of ©’. Then 
© is a normal divisor of © and the correspondence x$}-—>x' is an isomorphism 
of the quotient loop @/H and G’. 


For if kh and & are in § we have (h-k)’=h'-k’ =e’ and h-k is in §. Also 
x-h=k implies that x’-h’ =k’, x’ =e’, x is in Similarly h-y=k implies that 
y is in § and that § is a subloop of @. The elements y of x$, are products 
of x by a finite number of elements h of § with some order and grouping. 
Since every h’=e’ we have y’=x’ no matter what the order and grouping. 
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Conversely if y’=x’ we may write y=x-z for z in @ and y’=x’-2’=x’. But 
@’ is a loop and 2’ =e’, zis in §, y is in xO. It follows that x, Sx and, since 
=xH,=xH, then 


= = xD, = xO. 


Moreover x’=y’ implies that x$,=y, so that the correspondence x$—<x’ 
is one-to-one. We compute [(x-h)-(y-k) =x’ -y’=(x-y)’ for 
every hf and k of § and see that x<$-yO=(x-y)H. Conversely (x-y)-h=x-z 
has a solution z in © which may be written in the form z=y-w for w in ©. 
Then [x-(y-w) ]’=x'- (y-w)’=x’-(y'-w’) =[(x-y)-h]’ =x’ -y’,  y’=y'-w’, 
w’ =e’, wis in §, (x-y)$=xH-yH. This proves that xH-—>x’ is an isomorphism 
of G/§ and the loop @’. Then G/Q is a loop and § is a normal divisor of G. 
11. The multiplications of quotient loops. If $ is a normal divisor of G 
and '=§,, the elements of @/§ are the cosets xI. The elements of G,/T 
are the cosets ST = I'S for S in @,. Each such coset defines a transformation 


(16) gl — (gP)(ST) = (g5)I. 


But if aST =gST we have a! S=gI'S, aI’ =gI. Hence (16) is a permutation 
of G@/S. We may identify the permutation (16) with ST and have shown that 
the elements of the quotient group @,/I' may be regarded as being permuta- 
tions of the quotient loop G/§. 

The right multiplications of G/§ are the permutations 


that is, the permutations R.I of @,/T. Similarly the left multiplications of 
@/H are the permutations L,I. The group G, is generated by the right and 
left multiplications and thus G,/T is generated by the R,T and L,I. But these 
permutations also generate the group of permutations for @/§. 


THEOREM 5. Let D be a normal divisor of © so that T=G* is a normal di- 
visor of G,. Then (@/H), = G,/T. 


12. Finite loops. In our proof of Theorem 3 we showed that the hypothesis 
«xI'-yI =(x-y)T implies that is contained in for every S which is 
either a right multiplication or a left multiplication. In the case where @ is , 
finite so is T and a comparison of orders then yields STS=Tl, !'=STS-'. 
Indeed it is true that in the finite case if we have ST'SSTI, for elements S 
which generate G,, the group I will be a normal divisor of G,. Thus we have 


THEOREM 6. Let $ be a subloop of a finite loop © and define x'=xH, for 
every x of G. Then if (x-y)'=x'-y’ for every x and y of & the quotient set G/H 
ts a loop and $ is a normal divisor of ©. 


We may also prove 
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THEOREM 7. The order of a normal divisor $ of a finite loop © divides the 
order of 


For if $ is a normal divisor of @ we have seen that every element x of G 
is in one and only one set xO. If x-h=x-k then h=& so that each coset xD 
has exactly as many elements as does §. Then the order of © is the product 
of the order of § and the order of G/§. 

This result evidently implies that if a finite loop © is homomorphic to a 
loop @’ the order of @’ divides the order of G. 

13. Simple loops. A subgroup I of G, defines a proper subset eI’ of © if 
and only if I is intransitive. As in the theory of groups we call a loop simple 
if its only proper normal divisor is the identity group. Then we have 


THEOREM 8. A loop © is simple if and only if the group G, has no intranst- 
tive normal divisor except the identity group. 


Every isotope of a loop @ is isomorphic to a principal isotope G@ of ©. 
If the original isotope is a loop so is G@“ and we may let f be the identity ele- 
ment of @. Theorem 1 implies that @ <G,. The relation of principal 
isotopy is symmetric and thus @, = @{. We pass to a loop @“ isomorphic 
to G@® and defined by 


(1) (0) -1 (0) 


with C= Then hase as its identity element. Moreover = CGC-1 


= G° =G, since C is in G,. The application of Theorem 3 yields 


THEOREM 9. Every loop isotopic to a loop © is isomorphic to a loop having 
precisely the same normal divisors as ©. 


Note that we are not stating that the normal divisors eI!’ of © are iso- 
morphic or even isotopic to the normal divisors of G. They merely consist 
of the same elements. If §=eI then multiplication for § in G is defined by 
y:h=yR, and in G@™ by (y, h) =yCARiscC—. It does not appear necessary 
for § in G and in G, to be isotopic. 

As a consequence of Theorem 9 we have 


THEOREM 10. All loops isotopic to a simple loop are simple. 


14. Centers. The center € of a loop G is the set of all quantities c of © 
such that the commutative and associative laws for products hold whenever 
c is one of the factors. The condition x-(y-c)=(x-y)-c is equivalent to 
R.L.=L.R. for every x, and c-(x-y)=(c-x)-y yields L-R,y=R,L, for every c. 
But x-c=c-x is equivalent to L-=R,. Hence every R, is in the center A of 
the group G,, every c=eR,, C€=eA. 

Suppose now that H is in A and that c=eH. Then x-c=cL.=eHL,=eL,H 
=xH, H=R.,. Similarly c-x=cR,=eHR,=eR,H=xH, L.=R,. We also have 
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R.L:=L:R,. so that x-(y-c)=(x-y)-c and similarly c-(x-y)=(c-x)-y. But 
then R,..=R,R.=R.R,y=R.., and (x-c)-y=x-(c-y), c=eH is in €. Hence 
€=eA. : 

The correspondence c>H=R, is one-to-one since if H is given then 
c=eH. Also eH -eK =eHR,..x=eHK for every H and K of A. Thus we have 


THEOREM 11. The center © of a loop @ with identity element e is isomorphic 
to the center A of and €=eA. 


If G and @™ are loops which are principal isotopes the group G, = @” 
and the centers of G and @™ are isomorphic. By Lemma 2 we have 


THEOREM 12. Isotopic loops have isomorphic centers. 


15. Homotopy. While it has been convenient to define isotopy for alge- 
braic systems consisting of the same elements it is not, of course, necessary. 
Thus we may define two multiplicative algebraic systems © and @’ to be 
isotopic if there exist nonsingular mappings A, B, C of G on @’ such that 


(17) = (x-y)C. 


The concept of homomorphism may now be generalized. We call two 
mappings A and B of a set G on a set G@’ = GA = GB equivalent if there exists 
a nonsingular transformation N of @ and a nonsingular transformation S of 


@’ such that 
B= NAS. 


Then we shall say that a multiplicative system © is homotopic to a multi- 
plicative system @’ if there exist equivalent mappings A, B, C on © to G’ 
such that @’=GC and (17) holds for all x and y of G. Then A=NCS, 
B=QCT, xNCS-yQCT =(x-y)C. Define (x’, y’)=x'S-y’T for every x’ and 
y’ of @’ and (x, y) =xN-'-yQ-' to obtain (xC, yC) =(x, y)C. The argument 
is readily reversed and we see that © is homotopic to WG’ if and only if an isotope 
of © is homomorphic to an isotope of ©’. 

We note now that @ is itself homomorphic to an isotope of @’ if there 
exists a mapping P of G on G@’ and nonsingular transformations S, T, U of G 
such that @’ = GP and (xP, yP) =(x-y)P where (x’, y’) =(x’S-y’T)U. This is 
equivalent to (17) with A=PS, B=PT, C=PU-'. Then A=C(US), 
B=P(CT). Hence (17) implies that © is homomorphic to an isotope of G’ 
if there exist nonsingular transformations S and T of G’ such that A =CS, 
B=CT. However such transformations may not exist. 

For example let @ be any group of at least three distinct elements and C 
be any singular mapping of © on a set @’ of: at least two distinct elements. 
Then there exist distinct elements g, h, k in @ such that gC=hC, gC¥kC. 
The transformation N on © which interchanges h and & and leaves all other 
elements of & unaltered is nonsingular and we define a homotopy of G on @’ 
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with A =B=WNC. This defines the product operation in @’ and is such that 
gN=g, gNC=gC#¥kN=hNC. However if A=CS we could conclude from 
gC=hC that gCS=hCS, gA =hA, a contradiction. 

Let us observe that if we designate the right and left multiplications of 
©’ respectively by R&} and then (17) is equivalent to 
yBL{} =yL.C and thus to 
(18) ARs =RC, Bla 

We assume that @ is a loop homotopic to a loop @’ and let e be the identity 
element of @. Replace x and y in (18) by e to obtain 

C = ARs = Blu. 

Since @’ is a loop both R{ and L) are nonsingular transformations on @’ 
and @ is homomorphic to the isotope G’’ of G’ defined by (x’, y’) =x’S-y’T 
where S-!= RY, T-!=L{. Since (xC, yC) = (x-y)C we see that eC is the iden- 
tity element of @’’ and @”’ is a loop. We have proved 


THEOREM 13. Let a loop © be homotopic to a loop G’. Then © is homomorphic 
to a loop isotopic to ©’. 


If S is the normal divisor of G which is mapped by C on eC the quotient 
loop G/Q is isotopic to G’. We note also that we have the almost immediate 


Coro.iary. Let a group © be homotopic to a loop G’. Then © is homo- 
morphic to &' and @’ is a group. 


For by Theorem 13 the group © is homomorphic to @’’, G’”’ is a group 
isotopic to G’, G’’ is isomorphic to G’ by Theorem 2. 
16. Special loops. A loop G may be called alternative if the conditions 


hold for every g and x of G where g-g-' = g~'- g=¢ is the identity element of G. 
Then it is known() that every two elements of @ generate a subloop which 
is a group. 

An example of such a loop is given by the nonzero elements of the non- 
associative Cayley-Dickson division algebra of order eight over the field of 
all real numbers. This loop contains a finite loop © of sixteen elements. It 
consists of eight basal elements e:=1, é2, - - +, and their negatives and 
e;(e,¢x) = + (e,€;)ex. The algebra is a central simple algebra and thus the center 
€ of the knot G consists of the group of two elements 1, —1. Evidently the 


(®) See R. Moufang, Struktur von Alternativkorpern, Math. Ann. vol. 110 (1935) pp. 416- 
430, for a discussion showing that this set of postulates imply that every two elements of the 
loop generate a group. The postulates are given also in the Hausman-Ore paper on page 993. 
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quotient loop G/€ is the abelian group of exponent two and order eight. 
This is important as an example of a case where § is a group and @/QG is a 
group but @ is a loop which is not associative. 

The theory of alternative loops is only one of many interesting questions 
about loops suggested by topics of the theory of nonassociative algebras. For 
example, the construction of new simple algebras recently(*) given by the 
author yields, by the use of basal units, a construction of what may be called 
a crossed extension % of every finite loop &, in which & is a normal divisor 
of %, and the quotient loop $/Y is a prescribed permutation group on & leav- 
ing its identity element unaltered. The study of other properties of such loops 
should prove to be of considerable interest. 


(°) Non-associative algebras. 11. New simple algebras, Ann, of Math. vol. 43 (1942) pp. 708- 
723. 
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PERMANENT CONFIGURATIONS IN THE 
n-BODY PROBLEM 


BY 
CARL HOLTOM 


1. Introduction. The problem of two bodies for spheres, homogeneous in 
concentric layers and finite in size, was first solved in a geometrical way by 
Newton [14]() about 1685. Euler [4] gave the first detailed analytical solu- 
tion of the problem in 1744. In 1772 Lagrange [8] gave four particular 
solutions of the three-body problem in his prize memoir. All solutions of the 
two-body problem and Lagrange’s particular solutions of the three-body prob- 
lem belong to that special class called permanent configurations(?), which we 
we shall define presently for the case of -bodies. 

Consider m free particles in space which attract each other along lines 
joining them according to any function of the distance. If all the masses are 
projected with initial velocities, and thereafter are acted upon only by the 
force of attraction, their orbits will be space curves. The problem of deter- 
mining the positions and velocities of the bodies at any later time is the 
problem of bodies. A configuration of n bodies is said to be permanent if, as 
the masses move in their respective orbits, the ratios of the mutual distances re- 
main constant. Such a configuration may change in size but not in shape. 

The literature on permanent configuration problems contains two methods 
of approach, (1) that of establishing continuity between the configurations for 
n—1 and n-bodies, and (2) that of characterizing the configurations for any n 
by the necessary and sufficient conditions which the n(n—1)/2 mutual dis- 
tances must satisfy. Collinear configurations were discovered for the case n =3 
by Euler [5] and for any m by Lehmann-Filhés [9] and F. R. Moulton [13], 
the latter using the first approach. Noncollinear configurations for the case 
n=3 were discovered by Lagrange [8], who also treated the collinear case. 
Dziobek [3] discussed the general approach (2) and arrived at some results 
for the case » =4, which case was treated in detail by MacMillan and Bartky 
[12]. W. L. Wiliams [16], applying the sage method, considered the non- 
collinear case m:*5. Some noncollinear solutions of the ”-body problem have 
been found by Hoppe [7], Andoyer [1], Longley [10] and Emilia Breglia [2], 
but all have sorae element of symmetry. 

Space per:nanent configurations are the scarcest and have the undesirable 
property that the motion of all bodies is either toward their common center 


Presented to the Society, September 13, 1943; received by the editors March 8, 1943 and, 
in revised form, July 8, 1943. 

(*) Numbers in brackets refer to the bibliography at the end of the paper. 

(?) MacMillan [11, pp. 71-72, 75-78]. 
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of gravity, in which case the configuration lasts for a finite time, or in the 
the opposite direction, in which case the configuration is expanding without 
limit(*). In fact it can be shown that all permanent configurations are either 
space configurations of the dilating without rotation type or they are 
planar(*). We shall treat only the plane configurations, and in particular only 
those rotating without dilation, for if the law of attraction is the Newtonian 
law and there exists a permanent configuration in which the m bodies revolve 
in concentric circles about the common center of gravity, the bodies may also 
move in similar conics with the common center of gravity as foci(*). 

In most cases mentioned above it has not been necessary to restrict the 
masses Or mass ratios, but in this paper we shall, upon applying approach (1), 
make certain restrictions which are sufficient, though they may not be neces- 
sary, to insure continuity. 

2. The equations for a permanent configuration. The problem of bodies 
belongs in the field of differential equations. If we require that the ratios of 
the mutual distances remain constant, the problem becomes one of permanent 
configurations and belongs in the theory of implicit functions. As pointed out 
in §1, we need require only that the motion be circular, that is, the distances 
themselves remain constant. 

Let the origin be taken at the center of mass of the system, and let w 
denote the constant angular velocity of the m masses about the origin. The 
equations for a permanent configuration are(*) 


(2.1) i= 1,2,---,m, 
j= 
where A =w?, (743)? = (x; and the prime on the summation de- 
notes the sum for all 7 except j equal ¢. It is a problem now in implicit function 
theory to solve equations (2.1) for the 2% coordinates x; and y; in terms of 
the parameters m; and i. 

3. Existence of a point of libration. The first step in establishing continu- 
ity between the solutions for »—1 and m bodies consists of proving the exist- 
ence of at least one point of libration for the case of n—1 bodies, and this 
section is devoted to that purpose. 

Suppose that for a set of positive values +++, A, and for 
n2=4, the first »—1 equations of each set (2.1) have a solution (x?, y?) 


i— = 0, 


(*) The space configuration for n =4 was first noted by Lehmann-Filhés [9]. Cf. also Mac- 
Millan [11, p. 74], and Wintner [17, p. 279]. 

(*) Wintner [17, pp. 287 ff.]. 

(®) Wintner [17, p. 300], or MacMillan [11, p. 74]. 

(*) Wintner [17, p. 302]. 
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(é=1, 2,- +--+, m—1) and that these points P?=(x?, y?) are not collinear. If 
an infinitesimal mass m, be added to the system at the point (x,, y,) then 
x, and y, must satisfy the equations 


— 


= 0, 


(Yn — 
2(Xn, Yn) x)? + (yn ye)? 


The subscript » may be omitted, and the two equations 
(3.2) o1(x, y) 0, o2(x, y) = 0, 


are the equations of two algebraic plane curves which we shall call C; and C, 
respectively. Since the —1 points P? satisfy both equations (3.1), both 
curves pass through each of the P?. It will be shown that C, and C; have at 
least one real intersection other than the n—1 points P?. 

In order to determine the behavior of the curves in the neighborhood of 
any one of the points P?, we may write the equations for C; and C; in the form 


Gi(x, y) = (x — + [(« + (y— ys) y) = 0, 
G.(x, y) = (y- + ai) + (y— ys) fal, y) = 0, 
where 
Kane = 
1\%, = 
(y y;)m; 
[(x x)? + (y 
Since fi(x?, yf) =0, and fo(x?, y?) =0 from the ith equation in each set (2.1), 


it follows that fi(x, y) and fe(x, y) may be expanded in powers of x—x? and 
y—y?, vanishing for x =x?, y=y{. Thus 


— Ax, 


(3.3) 


fa(x, y) 


(x — 
+ Alx— x) 


o1(x, y) 


y= 
— + (y — 
+ — + Bly — 


$2(x, y) = 


(3.1) 
— Ayn = O. 
(3.4) 
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The slope of C, is 


and the slope of C; is 


On evaluating these slopes at (x?, y?) we find that C; passes through P? tangent 
to x =x? and C; passes through P? tangent to y=y?. 
We shall determine next on which side of the tangent line the curve lies in 
the neighborhood of P?. Let x—x? =r cos 0, y—y?=r sin 9 then equations (3.2) 
and (3.4) become, in polar coordinates, 


¥i(r, 0) = (m,/r? + Ayr) cos + By sin@+--- =0, 
0) = (m;/r? + Aor) cos + By sin@+--- =0. 


From (3.3) we note that of:/0y=0f2/dx, and hence Bi=A:z. Since m; 
and rf are positive we may choose 7 so small that both (m;/r°+A:) 
and (m,/r'+B,) are positive. Then as @ varies from 0 to 27, vilr, 4) 
changes sign at @=arctan [(m;/r?+A1)/(—B,)], and yz changes sign at 


Y Y 


Fic. la 


6=arctan [—B,/(m;/r?+Az)]. Hence the curvature of both C; and C, at 
P? depends only upon the sign of B,. The behavior of the curves at any P? 
is either that of Fig. 1a or Fig. 1b. In case B,=0 the behavior of the curves 


y 

iX=X? 
| \X=X? i 
| 
: Ci 
C2 
A 
xX xX 
|By>O 
B,<O 
Fic. 1b 
| IG 
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is governed by the sign of the first nonzero quantity in the sequence 
= O%f2/dxdy, = 


and the cases pictured in Figures 1a and 1b are all that can occur. In order 
to prove that there is at least one real solution of (3.2) other than the P?, 
let us denote by P® that P? which has the largest ordinate. In case B, is nega- 
tive at P® the curves are as shown in Fig. 2, and we shall prove that there is 


Fic. 2 


at least one real intersection of C; and C; in that section of the plane for which 
y>y and x>x°. 

By choice of notation there are no P? in this region and suppose that there 
is no other intersection. Let P=(x, y) move from P® to + along y=y’. 
Along this ray P meets C; first, namely at P®, and then meets C, later, since 
¢: varies continuously from + to — © as is easily verified in (3.1). Simi- 
larly, as P moves from P* to + along x =x° it meets C; first and C; later. 
Let this region of the plane be covered by rays leading from P® to «. As P 
moves from P® to © along rays P°D,, P*D:,---, it will meet Cz first 
provided the slopes of the rays are sufficientiy small, while along rays 
P°B;, P°B:, - - +, it will meet C, first provided the slopes of these rays are 
sufficiently large. If there is no intersection of C, and C; in the region there 
is a last ray P*D of the rays P*D; along which P meets C; first and a last 
ray P°B of rays P®B; along which P meets C, first. The areas covered by the 
rays P°D,; and P®B; may (a) overlap, (b) be adjacent, in which case P°D and 
P°B coincide, (c) be separated by a sector BP°D. 


Y 
Bo By B, B 
X=X* 
iD 
D: 
/ D; 
Do 
X 
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Before taking up these three cases let us show that all rays from P® to © 
that lie in the region x>x° and y>~y®* cut both curves at least once. From 
(3.1) it is seen that ¢; and ¢2 are continuous functions of x and y for P within 
this region, and that ¢; and ¢2 each vary continuously from +o to —© as 
P moves from P* to © along any ray from P that lies in the region. Applying 
this result (a) to rays in the sector common to ByP®B and DoP*D, we conclude 
that every ray drawn from P® and lying in this sector, since it meets both 
curves and meets each curve first, must pass through an intersection of C; 
and C:;; (b) to the single ray common to both sectors, we reach the same con- 
clusion as in (a); (c) to rays in the sector BP®D separating those already cov- 
ered, we find that every ray, since it does not meet C, first and does not meet 
C; first but does meet them both, must pass through an intersection of C; 
and C;. In all cases one is led to a contradiction of the hypothesis that there 
is no intersection. 

It is evident that the same proof can be applied in case B, is positive at P°. 
In case there are several such points P® with the same ordinate the proof can 
be applied at any one of them yielding an intersection with ordinate greater 
than y*. Thus we have the following theorem. 


THEOREM 3.1. There exists at least one point of libration at which a zero 
mass m,, can be placed, so that it will, together with the given n—1 positive masses, 
form a permanent configuration. 


4. The solution as m, becomes positive. As shown in the last section there 
is a solution of (2.1) for positive A, m1, m2, +--+, ma, and m,=0. Let this 
solution be Pf. Since equations (2.1) are algebraic equations with coefficients 
functions of m;, the solution functions x? and yf are continuous functions of 
m, as long as the roots are finite and the equations do not have indeterminate 
forms, Consequently, x? and y? are continuous functions of m, if no x° or no y? 
become infinite, or if no pair (x?, y?) becomes equal to another pair (x?, y?). 
Furthermore, the real roots x? and y? of algebraic equations (2.1) with real 
coefficients \ and m,; can disappear only by passing to infinity, or by an even 
number of real roots becoming complex conjugate quantities in pairs. There- 
fore, the problem is to determine as m, varies whether 

(1°) for all finite x? and y? any pair (x9, y?) can become equal to another 
pair (xf, 

(2°) any x? or y? can become infinite, 

(3°) the solution (x?, y?) can ever become imaginary. 

5. The P? remain distinct. This section is devoted to problem (1°) of the 
last section. We shall treat the more general case in which any m; may vary. 

Let the notation be chosen so that 


(5.1) Smid 


where there is at least one inequality in the noncollinear case. Suppose that 


a 
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as m, varies all P? remain finite, and P? approaches P? in a manner (a) such 
that x?x?, and that j<k. In the jth equation (2.1) there is a term involving 
(x? —x?)/(r%)* =a% which becomes negatively infinite. If this jth equation is 
satisfied another term ajm,; must become positively infinite, and accord- 
ing to our notation 1<j. Now a% appears besides only in the /th equation 
and becomes negatively infinite in the term afjm;. If the th equation is satis- 
fied a term afm, must become positively infinite and according to the nota- 
tion »</. Continuing in this manner one arrives eventually at the situation 
in which a term involving af with one of the subscripts 1 becomes negatively 
infinite. But all terms in the first equation are negative except —Ax®, which 
cannot become positively infinite under the hypothesis that the P? remain 
finite. Hence, the hypothesis that P? approaches P? in the manner (a) is false. 

Suppose that P? approaches P? in the manner (b) such that x9=x?. For 
this case let us choose the notation so that 


0 0 0 0 0 


where again there is at least one inequality. In this notation, if P} becomes P? 
and P? becomes P?, our hypothesis is that P$ approaches P? in the manner 
x?=x° and p<qg. There is a term in the (n+ )th equation (2.1) involving 
(y8—y>)/(r8.)* =6%,, which becomes negatively infinite. By an argument simi- 
lar to that of the last paragraph one eventually arrives at the situation in 
which a term involving §2 with one of the subscripts 1 becomes negatively 
infinite. But all terms in the (w+ 1)th equation are negative except —dy?, which 
cannot become positively infinite under the hypothesis that the P? remain 
finite. Hence, the hypothesis that P? approaches P? in the manner (b) is false. 
This completes the proof of the following theorem. 


THEOREM 5.1. As the m; vary and the solution functions x? and y? remain 
finite the P? remain distinct. 


6. The P? remain finite. In order to show that as any m; varies no x? or yf 
can become infinite, let us again adopt the notation (5.1) and suppose that x? 
becomes positively infinite. From the center of gravity equation 


(6.1) > mix, = 0, 


some x? must become negatively infinite, and from (5.1) it follows that x? 
approaches — © and x? approaches + «. Consider the first equation (2.1). 
In order that it be satisfied P? must approach P? in a manner such that 
x?x°, that is, x2 must approach — ©. Now consider the second equation. 
If it is to be satisfied, P? must approach P? in a manner such that x8~x%, 
that is, x$ must approach — ©, Continuing this process we are led to the con- 
clusion that all xf approach — «, hence the hypothesis must be false. 


\ 
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In order to show that no y? can become infinite, we adopt the notation 
(5.2) and suppose that y? becomes positively infinite as m; varies. From the 
center of gravity equation 


(6.2) LX mys = 0, 


and the notation (5.2), it follows that y? approaches — ~, and y? approaches 
+. Consider the (n+1)th, (n+2)th, and so on of equations (2.1) and the 
proof is the same as that in the preceding paragraph with the corresponding 
changes in notation. Thus we have the following theorem. 


THEOREM 6.1. As the m, vary the P? remain finite. 


7. The solution as m, vanishes. Before we take up the question of the 
solution becoming imaginary (3° of §4), it is to be shown that as m, ap- 
proaches zero the equations (2.1) and the solution functions x? and yf remain 
regular, and that there is, accordingly, only one limiting position of the P? 
for the value m,=0. Let a solution for all positive m; and X be P/, and let 
some m; approach zero. 

First, if some Pj approaches some P; in any manner then it must be that 
P; approaches P;_, or Pj,,, for all other possibilities lead to the fact that one 
of the equations cannot be satisfied by arguments used in §§5 and 6. 

Secondly, if P} approaches P},, in the manner x} =x},:, and if the nota- 
tion is that of (5.2) the (m+ 7)th equation (2.1) cannot be satisfied unless 
approaches Pj in the manner x/_;.=x/. Thus approaches P},; in 
the manner xj.1=x/,1 as m; vanishes. But then the (m+j+1)th and 
(x+j—1)th equations cannot be satisfied unless Pj/_. approaches P}_; in 
the manner xj_2=x/_, and P/,: approaches P},; in the manner x/,2=x}41. 
This shifts the difficulty to the (n+j+2)th and (n+j—2)th equations. On 
continuing this process one arrives at the (m+1)th and (2m)th equations 
which cannot be satisfied under the hypothesis it was necessary to make. 

In.case P} approaches P},; in the manner x/ #x},1, then, if the notation 
is that of (5.1), the jth equation (2.1) cannot be satisfied unless P/_, ap- 
proaches P/ in the manner xj/.1~xj as m; vanishes. Thus Pj_1 approaches 
Pj,1 in the manner x}_1#xj,;. But the (j+1)th and (j—1)th equations can- 
not be satisfied unless P/_, approaches Pj_; in the manner xj/2%xj4, and 
Pj,2 approaches Pj,; in the manner x/42%x/,:. This shifts the difficulty to 
the (j—2)th and (j+2)th equations, and on continuing this process one even- 
tually arrives at the ist and mth equations which cannot be satisfied under 
the hypothesis it was necessary to make. 

In the third place, if any x/ or y/ becomes positively infinite as m, 
vanishes, it must be that.xj or y/ becomes positively infinite, for all other 
possibilities lead to the fact that all x/ or all y{ become negatively infinite 
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by arguments used in §6. If the notation is that of (5.1), the jth equation 
cannot be satisfied as x/ becomes positively infinite unless P/ approaches 
Pj_; in the manner x/ #xj_1, that is, x/_; must approach +o. Then the 
(j—1)th equation cannot be satisfied unless Pj/_; approaches P}_, in the man- 
ner xj.1~xj-s, in which case xj/_, approaches + ©. Continuing this process 
one arrives at the conclusion that x/, x7, +--+, x/1, x/ become positively . 
infinite, and by choice of notation xj,1, x/42, +--+, xX. become positively 
infinite. Hence the center of gravity equation (6.1) cannot be satisfied, and 
the hypothesis that x/ become positively infinite is false. 

The proof that y/ cannot become positively infinite as m; vanishes is pre- 
cisely the same as that of the last paragraph with the notation that of (5.2) 
and the argument beginning with the consideration of the (»+/7)th equation 
(2.1). The proof that no x} or y/ can become negatively infinite as m; vanishes 
will read the same as the proof above with the words “positively infinite” re- 
placed by “negatively infinite.” This concludes the proof of the following: 


THEOREM 7.1. Any solution P} of (2.1) for positive m; and \ remains regu- 
lar as any m, vanishes. 


In particular we shall use the following corollary. 


CoROLLARY 7.1. The solution P{ for positive m; and \ remains regular as 
mM, vanishes. 


8. The condition that the P? remain real. Since the solutions of (2.1) are 
continuous functions of m,, it follows that no two solutions which are real 
for m, =0 can ever become complex conjugate solutions for any positive value 
of m, without having first become equal. If a multiple solution of (2.1) is 
impossible for a set of finite positive values of m,, then it is impossible for 
any real solution to disappear by becoming complex or for any complex solu- 
tion to become real. 

The conditions that a set of simultaneous algebraic equations shall have 
a multiple solution are that a set of values of the variable shall satisfy the 
equations, and that the Jacobian of the functions with respect to the inde- 
pendent variables shall vanish for the same set of values. 

Since equations (2.1) are invariant under a rotation of axes, let the axes 
be chosen so that y2=0 in case n=2, and so that »,=0 for 23. It is to be 
shown that for »=2 the first three equations (2.1) are independent for all 
positive values of \, m; and ms, while for n23 the 2n—1 equations obtained 
from (2.1) by omitting the (n+1)th form an independent set for any positive 
X, me, and sufficiently small positive ms, m4, - , Mn. 

Let F; and F,4; denote the left members of (2.1) and let the Jacobian of 
these functions be A,. Furthermore, let the minor obtained by deleting the 
(n+1)th row and (n+1)th column of A, be D, and let »23. A real solution 
P? of (2.1) for positive \, m1, ms, , and m, =0 will remain real as m, 


Aw 
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increases from zero through positive values for which D, does not vanish. 
This will be established when it is shown that D, +0 for sufficiently small 
positive m;,j7=3,4,---,m. 

The following notation will be used. 0F;/0x;=a;;, OF OF 
=0F,4:/0x;=c;;. From (2.1) it follows that, for 17, 

— 23)? — (155)? — ys)? — 
= mi, 


aij 


8.1 


(r:4)5 


ij 
and, for i =j, 
n—l n—1 n—1 
k=l k=1 k=l 
where = Gil, = Cilt-n = 
9. Case n=2. Equations (2.1) in this case consist of four equations. Let 
us choose the axes so that y2=0, then the solution of these equations is 
= — m2/(AM*)"8, = 0,7 


(9.1) 
= m,/(AM*)"3, y2 = 0, 


where M=m +2. 
To compute the elements of A; the following quantities, x; —xz2= —(M/A)™, 


are substituted in (8.1) and (8.2). We find 
— 2dm2/M —» 2\m2/M 0 0 
2m,/M — 2dm,/M 0 0 

0 0 —dm/M —dm/M 

0 0 — dm,/M —dm2/M 


A: 


It is quite evident that A;=0 and it remains to be shown that the minor 
D; in the upper left-hand corner is different from zero. After adding the sec- 
ond column to the first, and then subtracting the first row from the second, 

0 — 3 0 

Since D,+0 for all finite positive values of \, m: and mz the solution of (2.1) 
for m.=0, namely, x,;=0, y:=0, ye=0, remains real and varies 
continuously to the values given in (9.1) for all finite positive values of 
A, m, and mz. 

10. Case n=3. By choosing the axes so that y; =0 the solution of the six 
equations (2.1) is : 
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= 2(ms -+ myms + ms) K, y1 = 0, 
(10.1) = — + myms + 2mym2 — ms) K, 2 3° 
— (2myms + mymz + — m:)K, mK, 


where M =m,+m2+msz, and K 

If the elements of A; are computed in terms of \, m,, mz and mz, As will 
vanish, as can be shown by direct computation, since the system (2.1) is 
dependent and ¥; can be chosen arbitrarily. D; is the minor obtained by leav- 
ing out the 4th row and 4th column of A; and has determinant different from 
zero for all finite positive \, 1, ms, and ms, as a rather long series of com- 
putations will show. 

However, the proof that D; is different from zero for all \, m1, mz and 
sufficiently small ms; is short and will be given here, since it is the same type of 
proof that will be used in the general case. In the notation (8.1) and (8.2), 
Gi2 Gis C11 C12 Cis 
G11 G3 Ci2 C13 


G22 Co1 C22 
a2) C23 


a Cu 


C12 bn 
C21 Coz Caz bes 


Con Cos 


C31 C32 Caz 


After rearranging rows and columns in A; we have 
@i1 C11 @i3 C13 
G22 C21 G23 C23 
«(C12 Cis 
Cor Con C23 


G31 G32 Cai 


C31 


and if m;=0, A; and D; become 


G32 C31 


Car 


530 
C11 
C21 De2 O23 
. 
Mm Cas dss 
:0 
0 
As 
O 
A; = 
:0 
ss 
and 
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0 


0 0 


my 3 —3*/2m_, 


2m, 3 2m, 3 2(m m2) 
M M M 


On expanding (10.3) we obtain 
(10.4) Ds = — 
Since D; of (10.2) is a continuous function of ms; and, as (10.4) reveals, is 


different from zero for positive A, m1, m2, and m3=0, it is different from zero 


for sufficiently small positive ms. 
11. Case n=4. Using the notation (8.1) and (8.2) we have 


i,j = 1,2, 3,4, 


Co1 Co2 Cos Cra bee 


C31 C32 Caz Cee bse 


Car Can Cas Cae ! dae 
After rearranging rows and columns, we have 


G14 C14 
24 C24 
34 
34 O34 


Ca 


G41 Cai Caz Cas Gas Cua 


Car Can Cas Dae das Cas Daa 


Car Car Cas Dar Dae Das Can bug 


For m,=0 these become 


2me 2m. 
-(— 
0 
M M 
ms 
0 
0 0 0 7; 
4M 4 
| 
bi; 
and 
Ci2 Cis C14 
Coz Cos C24 
a; ; 
Css (C34 
D,= 4 Coo Cas |, 1,7 = 1,2, 3, 4. 
bss 
As 2 
A, = 
| 
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0 
0 


G41 Gao Car Cas Cus 


Car Cao Cas Dae Das Cas 


Car Car Cas Dar Dae Das Cas bug 


Since x; and y; are continuous functions of m,, D, is a continuous function 
of m, and is different from zero for m,=0, provided that A, m, and mz are 
positive, that ms; is so chosen that D3;+0, and that (x4, y«) is not a multiple 
solution of the 4th and 8th equations (2.1). Hence, D,0 for sufficiently 
small positive values of m,, and the solution, real for positive A, 11, m2, ms, 
and m,=0, varies continuously and remains real for sufficiently small posi- 
tive values of m4. 

12. Existence of a solution for positive m,. Continuing the process of 
§§9, 10, 11 it is possible for one to choose ms, me, , successively so 
that Ds, De, -- +, Das remain different from zero. In the general case A, 
will be zero due to the dependence of equations (2.1). The minor D, obtained 
from A, by deleting the (#+1)th row and (”+1)th column will be, in the no- 
tation of (8.1) and (8.2) and after rows and columns have been rearranged, 


in Cin 


Gn1 Gn2*** Gnn—1 Cn2 Cnd*** Cn 


Cnt Cn2*** Cn n—1 baa bas bn n—1 


For m,=0 we have 


OF OF Yn 
OF OF 2n/9Yn q 


D, = Da-1 


Since x; and y; are continuous functions of m,, D, is a continuous function 
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: 0 0 
:0 0 
: 0 
0 
A D 0 
A D 0 
leer = 
0 -- 
i an Con 
Qn-1n 
Can bon 
ban 
\ 
| ban 
| 
| 
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of m,; and, if (x,, y,) is not a multiple solution of the mth and (2m)th equations 
(2.1), D, #0 for m,=0. Therefore, D, +0 for sufficiently small values of m,, 
and the solution, real for positive X, mm, m2,---, m,1, and m,=0, varies 
continuously and remains real for sufficiently small positive values of m,. 
This concludes the proof of the following theorem. 


THEOREM 12.1. For arbitrary finite positive \, m, ma, and sufficiently small 
positive ms, m4, - - > , my there exists at least one real solution of equations (2.1). 


It should be noted that, from the Theorem of Lagrange and our remark in 
§10, ms need not be restricted and hence we have the slightly more general 
theorem. 


THEOREM 12.2. For arbitrary finite positive masses m,, m2, m3, arbitrary 
finite angular velocity, and sufficiently small positive masses m4, ms, +++, Mn, 
there exists at least one noncollinear plane permanent configuration. 


13. A special form for the equations. In §12 we showed that under certain 
conditions D,+0. It follows from this result that for any A, m, me, and 
properly chosen ms, m4, - , Mn, equations (2.1) with the (m+1)th deleted 
form an independent set. It is easy to show, by processes similar to those of 
§§8 to 12, that the first and (n+-2)th may be replaced by the center of gravity 
equations (6.1) and (6.2) yielding an independent set of 2n—1 equations 
equivalent to the one just mentioned. 

It is our purpose in this section to put this latter set of 2n—1 equations 
into a form convenient for calculating a solution. We shall then solve them 
for the coordinates in terms of \ and the masses m;. Let the mass ratios be 
1, =p, Mi42/m =o; (¢=1, 2, - ++, m—2) and for the particular solu- 
tion we wish to construct we shall take \ =m. The equations then become 


ai + + D> 22430; = 0, 
yi + + = 0, 


on + ———-o; = 0, 
(r21)* (r2 2+3)* 


where >, denotes >-%-? and >>* denotes the sum over the same j except for 
Let us multiply the next to last equation by y; and the last by —x; and 


13.1 
oe 
i= 3,4,---,m 
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form the sum. The resulting equation, 


(ri)? (ri2)* (r; 244)? 


is used to replace the next to last above. By means of the complex numbers 


we are now able to write the 2n—1 equations (13.1) in the form 
+ + 22440; = 0, 
(r21)* (re 2+;)° 
— + + + 2450; = 0, t= 3,4,--+,m, 
which is convenient for computing the solution. 
In the case » =4 the coordinates may be expressed as power series in the 
mass ratios ¢; and o2 with coefficients which are Laurent series in wu. For n>4 
the coordinates may be expressed as power series in (¢;/3)'/* with coefficients 


Laurent series in p. 
14. Solution for m =4. In this case the seven equations (13.2) are 


o; = 0, 


+ + + = 0, 
+ + + = O, 
— + = 0, 
(r21)* (r23)* F (r24)* 
— — X23 Xays 


(r31)* (rs2)* : (rs4)* 


= 0, 


V1 — Y2 Ys Va 
= 0, 
yet a)! + (ra)? ut (rao? 


(ras)? (ra)? (ra)? 


o, = 0, 


(rai)* (r42)* (r43)* 
Since one of the coordinates is arbitrary we may choose y:=0. The solution 
of these equations may be obtained as follows. With o2=0 the first five equa- 
tions (14.1) form an independent set of the six equations (2.1), and the 
solution functions may be obtained from (10.1) by proper change of parame- 


[November 
(14.1) 
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ters. The solution functions thus obtained and arranged in powers of a; are 


3- 


1 — 


(1 + + 6 (1 + 


31/2 1 + 1/3 
(14.2) n= [o- |, 
B 


i_ 9+ 14 13? 
2 12y 


31/2 


3 


+u 


The last two equations (14.1) miay be obtained from the preceding two 
by the transformation, in two rowed notation, 


|. 


(14.3) X2 x3 X4 Ya 


and then setting o:=0. Hence the solution functions x4, ys of the last two 
equations may be obtained from the solution functions x3, ys except for the 
coefficients of the powers of o:. On substituting 


t= —((1—4)/2)(1 +4)? + 
in the last two equations (14.1) and equating coefficients we find 


5\ 1 = 
= (— + 


3u2 7 


= 
a(u) ( 81. 18) 243. «54 


4 8 


This completes the solution of (14.1) as functions of wu and a. 

In order to obtain the coefficients of oz in the solution functions of (14.1) 
we may use the fact that the system (14.1) remains unchanged by the trans- 
formation (14.3) and that the solution functions must have this same prop- 
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erty. Coefficients of higher powers of the parameters may be computed, and 
herein lies the convenience of the form (14.1). The coefficients of any term 
in the expansion of x1, y2, x2, Xs, Ys, X4, may be determined successively, 
and in this order, by substituting series with undetermined coefficients into 
the equations in the order given in (14.1). 

The solution for »=4, computed as described above and arranged in 
powers of o2, will follow immediately. The coefficients in this series are them- 
selves arranged in powers of o; with coefficients which are Laurent series in p. 


3- 


6 


1—3 1-3 


312 1 1/3 1 1/3 
(0- (1 + ») +f + u) 


9 + + 13,? 


27 «12 81. «36 81 


5(3)/?  1687\ 1 


— (1 + +--- 


2-35 2-3* 


ys (1 + — 


81 5) 


(- 


1 + 
(14.5) 
Ste 1 )-2/8g 
= 
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i- [ 5\ 1 =) 
27 


+{- 


81 36 


(1+ 


5(3)4/2 1 


312 5\1 11 
n= 81 )—+ +3) 
2(3)2 
41(3)"2 3173 
+{( 38 fet). 


15. Points of libration for »=4, 5. In crder to find a point where m;=0 
may be introduced, we must solve the pair of equations given by the last of 
equations (13.2) for the case »=5, namely, 

A A A A 
+ + —~ = 0, 
(rs:)* (r53)* (r54)* 
Ys — ys — Ya 
+ o1 +——o; = 0, 
(r51)* (r52)* (rs3)* (r54)* 
where A =xyj—X 

If o;=0 these equations are precisely those which determine P;, hence Ps 
coincides with P; for o,=0. Furthermore, P;=P; is a solution of equations 
(15.1) for 010, the identity being in yu, o; and a2. In order to obtain a solution 
of (15.1) having the property (in §5) that P,#P3, we let o1=3v*. Upon sub- 
stituting 


(15.1) 


1 
— (1 + + pil(u)y + po(u)r? 
+ +--+ + + - 
31/2 
[(1 + + + q2(u)v? 


+ qs(u)v? +--+ + | 
in (15.1) the p;(u) and g;(u) may:be determined by equating coefficients of y. 


2/3 
_ 
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The functions ¢;(u) and c2(u) are the same as in (14.4). The result is 
Ww 
pi 2 4 u pe 4) = 6 144” 


9+ 144 + 13y? 
pls) = — 


(15.3) 


w= 
= 2” » 3 79" Be 


qs(u) = — (1 + 
2u 
Thus, as o; increases from zero to a positive value, a point of libration, de- 
noted by P;*, branches off from P; and moves to a finite distance from P3. 
We shall use only the upper sign in (15.3). 
The solution of 
Aw Ase ut Ass a+ Ava 
(r01)* (r62)* (r63)* (rea)? 


(re)? (re2)* (r63)* (rea)* 


for the libration point Pe may now be obtained from (15.2) by the trans- 
formation 


2=0, 


(15.4) 


which reduces (15.4) to (15.1). Therefore, as oz increases from zero to a posi- 
tive value, a point of libration P#*, whose coordinates are 


(1 + + + pilu) (=) 


+ po(u) (=)"+ ps(u) (=) 


branches off from P, and moves to a finite distance from P,. 
As the masses m, and mg increase from zero to positive values, the solution 


[a + + + -- + (=) 


x5 X6 V2 v4 Ye 
C1 
(15.5) 
; 
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functions (14.5), (15.2) and (15.5) must be extended to include powers of 
(o3/3)*/* and (o4/3)'*. This can readily be accomplished by the substitution 


Xs % Xe Ye Ya Yo G2 G3 


X2 Yo Yu Vo VB Va MBM OZ Ge Gy Ge 


which leaves the system (13.1), for n=6, unchanged. 

16. The solution functions for any n. It is evident that the process carried 
out in the last section may be repeated until the system contains any finite 
number of masses. Concerning the points of libration at any stage, we have 
the following theorem. 


THEOREM 16.1. As masses mo,-1 and mz, are added to the system, by allowing 
them to increase from zero to some sufficiently small positive values at points Px» 
and Px, respectively, two points of libration P3,,, and Ph. branch off from 
and Py, respectively. 


In regard to the solution functions at any stage the following theorem is 
evident. 


THEOREM 16.2. As masses mox41 and mMoi42 tmcrease from zero to some suffi- 
ciently small positive values, the solution functions x; and y; (t=1, 2, , 2k+2) 
vary continuously and may be expressed as power series in (o2%-1/3)"* and 
(o2%/3)"*, where and o2% are the mass ratios and re- 
spectively. 


As pointed out in §14, all the solution functions for nm =4 are power series 
in o; and a2. This is due to the fact that equations (13.1) are invariant under 


the transformations 

Xs 

i,j = 3,4,---,m, 


where the sign is to be taken “+” if i, j are both even or both odd, otherwise 
“— ” and also to the fact that the solution functions if m =4 are power series 
in For n>4 we note that only. x:, x2 and yz are power series in oi, and 
hence only these three functions are power series in the mass ratios 
01, O2,°**, Gn. Since x5 and ys are power series in (o:/3)/*, it follows 
that x; and y; (¢=3, 4, - are power series in (01/3)"/*, (¢2/3)"*,---, 
(on-2/3)". It should be noted, however, that x; and ys (¢=3, 4,---, m) 
may be power series in (0;/3)"*, (@n-2/3), but 
this solution does not have the property stated in Theorem 5.1, for as mn_2 
vanishes P,_, and P,, coincide. 

Let (¢;/3)'*=1}“. In case the solution functions are power series in 7}”, 
#=1, 2,--+, m—2, the following functions have all the properties stated 
in theorems of §§5, 6, 7, 8 and constitute a solution of (13.1). 
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2/3 


+ 123-1 + Tap 
2/3 


= Kio + Kiad, + Kiad, + Ki2). + 


y2 = Koo + Koad, + + + 
+ vas + 723-1 + + 


+ "tes + >, T2j-1 + 


where >> denotes the sum over j from 1 to the largest integer in (n—2)/2, 
and ).,’ the same sum over j except for j=i. The H’s and K’s are functions 
of u and the nonzero ones are 


= w(1 + Ais = Ais = + 


Hao = — (1+ 4)-*, 
1 — 3p 3 


= = — (1 + 


312 
= — = (1 + 


| 
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1 1 


= — 


Ai+1,2 


3127/1 25 
2 \3 
9+ 14y + 


= = — (1 + 
4u 


(3 + 
4u 
ye 2(3)'/2 29 31/2 7 


Op 27 12 3 


= — = 


= — 


Since (16.1) is a solution of (13.1), and hence of (2.1), for values of 
01, 02, * * * , Gn-2 for which the series converge, and these values may be at- 
tained by choosing m, sufficiently large, we may state Theorem 12.2 as fol- 
lows: 


THEOREM 16.3. Given n—1 arbitrary finite masses it is possible to choose an 
additional mass sufficiently large and an angular velocity sufficiently great, so 
that the n masses form a noncollinear plane permanent configuration. 


17. The polygon formed by the P;. A diagram illustrates clearly the prop- 
erties of a solution. In case  =1 the polygon, Fig. 3a, consists of one point P; 
at the origin. There is one point of libration P;* at a distance (A/m)'*=1 
from P;. As a zero mass mz is placed at P# and allowed to increase to a 
positive value, so that (m2/m,) =u >0, the configuration for m =2 is obtained. 
The polygon now consists of the line segment P,P2, Fig. 3b, with two points 
of libration P;* and P;* not collinear(”) with P; and P:. These points are the 
vertices of equilateral triangles with side P,P:= (1+). 

As a zero mass my is placed at P# and allowed to increase to a posi- 

(") The points of libration collinear with P,P; are obtained by solving the first three equa- 


tions (2.1) for x3, where 2 =3 and y; = y,=y;=m;=0. For fixed order x1, x2, xs this is equivalent 
to Lagrange’s quintic, cf. Lagrange [8, p. 277], or Tisserand [15, p. 155]. 
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tive value, so that (ms;/m:) =o,1>0, the configuration for »=3 is obtained. 
The polygon in this case is the triangle P:P2P3, Fig. 3c, which by the Theorem 
of Lagrange is an equilateral triangle of side (1+y-+0:)'/*. There are at least 
two(*) points of libration P,* and P,*. In general there is an (m+2)-sided 
polygon consisting of m points P; and two libration points P%,, and P%,». 


Y 


prO 


Fic. 3c 


From the solution functions (16.1) we observe an interesting property 
which we state in the following theorem. 


(*) According to Henrichsen [6], if the three masses are equal there are ten points of libra- 
tion, 
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THEOREM 17.1. As the masses m; (i=3, 4,-- m), located respectively at 
points P;, approach zero the points P, cluster about the two libration points for 
the two body problem. 


The geometric properties of these m-sided polygons, whether they are con- 
vex or concave, have not been determined. Furthermore, the possibility of 
removing the restrictions on the mass ratios seems to depend upon whether 
the determinant D,, in §12 is different from zero for all values of the masses. 
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